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Abstract 



The problems of the identity of proofs, equivalences of reductions in term rewriting 
systems and coherence in categories all share the common goal of describing the notion of 
equivalence generated by a two-dimensional congruence. This thesis provides a unifying 
setting for studying such structures, develops general tools for determining when a congru- 
ence identifies all reasonable parallel pairs of reductions and examines specific applications 
of these results within combinatorial algebra. The problems investigated fall under the 
umbrella of "coherence" problems, which deal with the commutativity of diagrams in free 
categorical structures — essentially a two-dimensional word problem. It is categorical 
structures equipped with a congruence that collapses the free algebra into a preorder that 
are termed "coherent" . 

The first main result links coherence problems with algebraic invariants of equational 
theories. It is shown that a coherent categorification of an equational theory yields a 
presentation of the associated structure monoid. It is subsequently shown that the higher 
Thompson groups F n i and the Higman-Thompson groups G U) \ arise as structure groups 
of equational theories, setting up the problem of obtaining coherent categorifications for 
these theories. 

Two general approaches to obtaining coherence theorems are presented. The first 
applies in the case where the underlying rewriting system is confluent and terminating. 
A general theorem is developed, which applies to many coherence problems arising in 
the literature. As a specific application of the result, coherent categorifications for the 
theories of higher order associativity and of higher order associativity and commutativity 
are constructed, yielding presentations for F n \ and G n> i, respectively. 

The second approach does not rely on the confluence of the underlying rewriting system 
and requires only a weak form of termination. General results are obtained in this setting 
for the decidability of the two-dimensional word problem and for determining when a 
structure satisfying the weakened properties is coherent. A specific application of the 
general theorem is made to obtain a conceptually straightforward proof of the coherence 
theorem for iterated monoidal categories, which form a categorical model of iterated loop 
spaces and fail to be confluent. 
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CHAPTER 1 



Introduction 

Coherence problems arise in category theory when one wishes to describe the free 
algebra generated by a particular structure. Typically, this problem boils down to solving 
a sequence of word problems: Which functors are equal? Which natural transformations 
are equal? Which modifications are equal? And so on up the dimensions. Our main 
interest here is in two-dimensional categorical structures. Within this context, coherence 
problems are related to several other problems: When are two proofs of the same theorem 
equivalent? When do two interpretations of the same sentence assign the same meaning? 
When do two programs implement the same algorithm? In order to gain some insight into 
the importance and meaning of coherence problems, we explore the analogy with natural 
and artificial languages slightly deeper. 

A written language may be thought of as a collection of symbols together with rules 
for manipulating and combining them. A sequence of such symbols is called a sentence. A 
sentence is grammatical if it can be constructed via the rules of the language. 

Attempting to ascribe meaning to sentences of a language is potentially fraught with 
difficulty. For a simple mathematical language, such as arithmetic, the meaning of a 
sentence is abundantly clear — it is the natural number obtained by carrying out the 
described calculation. For more complicated constructions, such as natural language, the 
problem can be significantly more difficult. 

One typically wishes to assign a meaning to every possible grammatical sentence of a 
language. If one considers sentences to be completely independent of each other, then, for 
any reasonably complex language, one would need to decide on the meaning of infinitely 
many sentences. Such a task is unreasonable in practice. One way in which to resolve this 
situation is to suppose that the language is compositional. That is, that the meaning of a 
sentence is composed from the meaning of its subparts. It is important to note that two 
related claims are being made here. First, there is a collection of basic syntactic structures, 
which carry meaning. These can be words, such as "dog", "cat", "table", "chair" etc., or 
they may be more complicated phrases or sentences. The second claim is that the meaning 
of a sentence built from these basic pieces is a composition of the meanings of the pieces. 
This compositionality principle is appealing on a number of levels, not least of all because it 
provides a reasonable explanation for a person's ability to comprehend sentences that they 
hear for the first time. A more technical reason is that one can show that any recursively 
enumerable language can be captured by some compositional grammar [Jan96]. 
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Within richly expressive languages, there is the potential for structural ambiguity. 
That is, a given sentence may have two distinct meanings even though the meanings of 
the individual words remains constant. For example, the sentence "The shooting of the 
hunters was terrible" may mean that the hunters had terrible aim, or that it was a shame 
that the hunters were shot. Within the framework of compositionality, the two meanings 
could only have arisen from composing the words in a different manner. 

When designing a computer programming language, one typically wishes to avoid the 
presence of any structural ambiguity. More technically, any two proofs ( "compositions" ) of 
the same typing judgement ("sentence") must carry the same meaning [CG90, Rey91]. 

A language that contains no structural ambiguity whatsoever is termed "coherent". 
The name stems from Mac Lane's construction of a coherent language for a monoidal 
structure on a category [ML63], which is the real starting point for this thesis. 

Mac Lane [ML 76] attributes the motivation for his development of the theory of 
monoidal categories to a question of Norman Steenrod: When is there a canonical map 
between two specified formal combinations of modules? Steenrod was considering the 
category of all modules over a commutative ring and the combinations of such modules by 
applying the functors ® and Horn. Monoidal categories abstract the structure of the tensor 
product of modules to create a bifunctor (g> on an arbitrary category. The main result of 
[ML63] says, essentially, that any two n-fold products that contain the same objects 
in the same order are naturally isomorphic via a unique canonical natural isomorphism. 
Interpreting the n-fold products as parsings of sentences and natural isomorphisms as weak 
equivalences between parsings, this result is akin to saying that monoidal categories do 
not contain any structural ambiguity. 

The investigation of coherence is certainly not limited to monoidal categories. Indeed, 
one may hope for a version of Mac Lane's theorem for many different types of covariant 
structures. A covariant structure on a category ^ consists of: 

• A collection of basic functors of the form c € n — ► % '. 

• A collection of equations between certain pairs of formally different terms built 
from the basic functors. 

• A collection of natural transformations between certain terms formed from the 
basic functors. 

• A collection of equations between pairs of formally different natural transforma- 
tions constructed via a sequence of compositions and substitutions of the basic 
natural transformations. These are typically called coherence axioms. 

One of the most basic covariant structures is that of a coherently associative bifunctor 
(g>. This structure consists of a natural isomorphism a : a <S> (b ® c) — ► (a (g> b) <S> c together 
with a coherence axiom stipulating that the following diagram commutes: 
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a® (6® (c® d))) 





(a ® 6) ® (c ® d) 



((a ® 6) ® c) ® d 



a® ((6® c) ® d) 



(a® (6® c)) ® d 



A special case of Mac Lane's coherence theorem for monoidal categories states that any 
other diagram constructed from a, ® and the identity natural isomorphisms commutes by 
virtue of the commutativity of the above diagram. 

In endeavouring to construct an analogous coherence theorem for an arbitrary covariant 
structure carried by a category, one may ask two related questions: 

(1) Is a given covariant structure coherent? 

(2) What coherence axioms are required in order to make a given covariant structure 
coherent? 

The main goal of this thesis is to tackle the above questions in the greatest possible 
generality as well as to develop applications of the resulting coherence theorems. In the 
following section, we give a more complete outline. 



Outline 

Chapter 2: The chapter starts by developing a definition of rewriting 2-theories. These 
form the main framework for our investigations and the chapter describes the free algebra 
generated by a rewriting 2-theory before showing that a rewriting 2-theory defines a Law- 
vere 2-theory and, hence, a covariant structure. After briefly discussing relations to other 
existing systems, the coherence problem is rigorously defined within the context of rewrit- 
ing 2-theories. Categorifications are introduced as a method for weakening an equational 
variety into a categorical structure and it is shown that a coherent categorification of an 
equational variety defines an equivalent categorical structure to the variety. Finally, some 
useful general tools for working with rewriting 2-theories are introduced. 

Chapter 3: Dehornoy [Deh93] introduced structure monoids as algebraic invariants of 
equational varieties. The main result of the chapter shows how to construct a presentation 
of the structure monoid of an equational variety £ from a coherent categorification of £. 
In certain situations, the structure monoid forms a group in a natural way and the result 
is extended to this setting. 

Chapter 4: The main direction of this chapter is to generalise Mac Lane's proof of co- 
herence for monoidal categories to rewriting 2-theories that are confluent and terminating. 
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"Terminating" means that there are no infinite chains of non-identity morphisms, while 
"confluence" is the property that every span may be completed into a square, as in the 
following diagram: 

I 
l 

y 

. . 

Subsequently, a general coherence theorem is developed for rewriting 2-theories describing 
invertible covariant structures, which directly generalises the situation of monoidal cate- 
gories. 

Chapter 5: This chapter develops a surprising application of the results of Chapter 4. 
Dehornoy [Deh05] has previously shown that Thompson's group F is the structure group 
of the variety of semigroups and that Thompson's group V is the structure group of the va- 
riety of commutative semigroups. Dehornoy also constructed presentations of these groups 
using Mac Lane's coherence axioms for the associated categorifications. In light of the re- 
sults of Chapter 3, these presentations are not too surprising. Indeed, the work in Chapter 
3 was directly motivated by these results. Chapter 5 begins by constructing varieties for 
higher-order associativity and higher-order associativity and commutativity. It is shown 
that the structure groups for these are the higher Thompson groups F n i and the Higman- 
Thompson groups G n ,i) respectively. The chapter goes on to construct categorifications of 
these varieties and thereby to obtain new presentations of F n ^ and G n ^. The coherence 
axioms for the categorifications directly generalise Mac Lane's axioms for the binary case, 
although a new class of coherence axioms is required in the higher-order case that are not 
present in the binary situation. 

Chapter 6: It is not the case that every coherent rewriting 2-theory is terminating and 
confluent. This chapter develops general coherence theorems for rewriting 2-theories that 
are not confluent and only weakly terminating, in a precise sense. The techniques are 
radically different from those of Chapter 4. The driving philosophy is that a parallel 
pair of morphisms are equal if and only if they admit a subdivision, each face of which 
commutes. As such, the approach is primarily through topological graph theory, where a 
subdivision is defined as a certain ambient-isotopy class of planar graph embeddings whose 
boundary consists of the parallel pair of maps under investigation. The resulting coher- 
ence theorem is also used to construct examples of finitely presented rewriting 2-theories 
that cannot be made coherent via only finitely many coherence axioms, but are otherwise 
well behaved. The related coherence problem of when there exists a decision procedure for 
the commutativity of diagrams arising from a rewriting 2-theory is also briefly investigated. 

Chapter 7: Iterated monoidal categories [BFSV03] arose as a categorical model of iter- 
ated loop spaces. As a rewriting 2-theory, they are particularly interesting because they 
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possess a nontrivial equational theory on both objects and morphisms, as well as being 
non-confluent. A highly technical proof that iterated monoidal categories are coherent is 
given in [BFSV03]. After introducing iterated monoidal categories, this chapter goes on 
to exploit the results of Chapter 6 in order to obtain a new, conceptually straightforward 
proof of coherence. 

The inter-dependence between chapters is indicated in the following Hasse diagram: 



2 




3 



4 




5 



6 



7 



Throughout this thesis, we read / • g as "/ followed by g n . 



CHAPTER 2 



Rewriting 2-theories 



Our main goal in this chapter is to define the class of two-dimensional algebraic struc- 
tures that form the basis of the following chapters. The definition that we develop uses 
a base set of variables. This is in contradistinction with the standard approach to two- 
dimensional universal algebra, which prefers a variable-free approach via categorical con- 
structions. The reason for choosing to work with variables is rather utilitarian: it retains 
a strong link to first-order term rewriting theory and, therefore, preserves the strong link 
with various computational and linguistic constructions. A more pragmatic reason for our 
definition in terms of variables is that it is precisely what allows us to bring various com- 
putational and combinatorial techniques to bear on otherwise categorical constructions. 
The choice of working with variables has two technical implications. First, it makes the 
transition from a presentation of a theory to a concrete algebraic structure on an arbi- 
trary category slightly more difficult than it otherwise might be. Second, it does not allow 
us to distinguish between certain different categorical structures. For instance, the map 
i : A® A ^ A® A gives rise to two different possible semantic interpretations: a map 
that preserves the order of the factors and one that reverses the order of the factors. Our 
construction blurs the distinction between these two semantic interpretations; indeed, ei- 
ther choice would provide an adequate semantics for the map. It is important to note that 
the claim being made here is that the two maps arise purely from two different semantic 
interpretations and that there is, a priori, no syntactic way in which to distinguish two 
interpretations, no More fundamentally, the combinatorial properties of the categorical 
structure are unaffected by the particular semantic interpretation of the maps. Indeed, we 
shall see in this chapter that all of the different choices of semantic interpretations yield 
isomorphic structures. Before jumping into the world of two-dimensional algebra, we seek 
some intuition from classical one-dimensional algebra. 

When developing a classical definition of equational varieties, one starts with a graded 
set of function symbols T and imposes a collection of equations, £, on the absolutely free 
term algebra generated by T on some set of variables X, which we denote by Ff(X). 
Quotienting out by the smallest congruence generated by £ on Fjr(X) yields the free 
(J 7 | 1 £')-algebra on X, which we denote by ¥^^(X). It is at this point that we run into a 
conceptual problem: the set of variables X holds a privileged position in the construction. 
If we wish to obtain the free (,F|£)-algebra on some other set Y then we run into a problem 
before we even start — the very concept of an (.F|£")-algebra was defined with the aid of 
XI The traditional way around this problem is to define an (jF|£)-algebra to be an algebra 
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A of type T such that for any equation (s, t) £ £ and any homomorphism p : Fjr(X) — ► A, 
we have p(s) = p(t) [BS81]. 

The viewpoint of algebras as being induced by homomorphisms from some particular 
free algebra is the starting point of Lawvere theories [Law04]. Here, we consider a function 
symbol of arity n to be a function n — ► 1 and use the Cartesian structure of Set in order to 
permute, duplicate and delete variables as we please. This allows us to replace equations 
with commutative diagrams and yields the category Th((.F|£)). This category has finite 
products; indeed, its objects are just the natural numbers, where a number n is considered 
to be the n-fold cartesian product of 1. The category of finite product preserving functors 
Th((jF|£)) — > Set forms the analogue of algebras qua homomorphisms in the classical case, 
allowing us to transfer the structure inherent in (J~\£) to an arbitrary set. 

Our basic strategy in this chapter is to replicate the above arguments in the two- 
dimensional setting in order to provide an abstract framework for categories with alge- 
braic structure definable in a variable-based manner. Our essential objects of study are 
rewriting 2-theories, which consist of a first order term rewriting system modulo a two 
dimensional congruence. This retains a strong link with computational structures. In- 
deed, syntactically, rewriting 2-theories can be seen as a generalisation of unconditional 
rewriting logic, which arose primarily in the study of concurrent systems [Mes92]. The 
syntax and algebraic semantics of rewriting 2-theories is covered in sections 2.1 and 2.2, 
respectively. Connections with rewriting logic and other systems are briefly outlined in 
Section 2.3. 

The fundamental focus of this thesis is coherence for rewriting 2-theories and we intro- 
duce this concept formally in Section 2.4. Subsequently, we explore the relationship be- 
tween equational varieties and coherent rewriting 2-theories in Section 2.5 before sketching 
some basic results in Section 2.6 that will be of frequent use. 

2.1. Syntax 

The purpose of this section is to introduce a general class of term rewriting systems 
whose semantics correspond to categories with an additional covariant structure. Con- 
cretely, we work with a term rewriting theory modulo a two-dimensional congruence. That 
is, a term rewriting system equipped with an equational theory on terms and an equational 
theory on reductions, together with an associated calculus of proof terms. 

Syntactically, we shall be working with structures of the form (J 7 ; T \ Ejr • Sr), where 
J- is a set of function symbols, T is a set of reduction (or transformation) rules, is an 
equational theory on T and St is an equational theory on T containing a certain basic 
congruence. Our main task in this section is to describe the structure that this data 
generates, which forms our two-dimensional analogue of Wi j:\s\- We begin by building the 
one-dimensional aspect of the structure. 

Definition 2.1.1 (Term Algebra). Given a graded set of function symbols T := J2 n 
and a set X, the absolutely free term algebra generated by T on X is denoted by ¥j^(X). 
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The next layer of structure adds an equational theory to Fjf(X): 

Definition 2.1.2. Given a graded set of function symbols T, a set X and a set of 
equations £?r on ¥j^(X), we denote by F/jti^v (X) the quotient ofFp(X) by the smallest 
congruence generated by £?. We write [t] for the image of a term t under the canonical 
homomorphism Fjr(X) — > F/^i^^X). 

We can now begin to describe a two-dimensional term rewriting theory. Our first step 
is to define a labelled term rewriting theory. 

Definition 2.1.3 (Labelled term rewriting theory). A labelled term rewriting theory 
is a structure {J-; C; T \ £p)x, where J- is a graded set of function symbols, X is a set of 
variables, £? is a system of Fjf(X)- equations, C is a set of labels and T is a subset of 
C x (F^|£^(X)) 2 satisfying the following consistency conditions: 

If (a, si, ti) and (a, S2, £2) o,re in T then s\ = S2 and t\ =t2- 
If (a,s,t) € T, we write a : s — > t. A member of T is called a labelled reduction rule. 

Given a labelled term rewriting theory {J 7 ; £; T \ £p)x, the particular choice of C is 
irrelevant. What is important is simply that there are sufficiently many labels for the 
number of reduction rules. Accordingly, we shall henceforth suppress explicit mention of 
the labels and write (JF; T \ £p)x for a labelled term rewriting theory. For the remain- 
der of this thesis, we fix an arbitrary countable infinite set X and write (J 7 ; T \ £?) for 
(J-; T I £t)x when the particular choice of variable set is unimportant. A labelled term 
rewriting theory embodies the basic reductions that are to generate all others. The next 
step is to obtain an analogue of the absolutely free term algebra for this higher dimensional 
layer of structure. This is achieved by the following definition, where the notation x" 1 is 
an abbreviation for x\, . . . , x n and F{s n /x n ) denotes the uniform substitution of the free 
variables x n by s n . 

Definition 2.1.4. Given a labelled term rewriting theory S£ := ("F;7 \£t)x, the set 
of reductions generated by Jzf is denoted ¥&{X) and is constructed inductively by the 
following rules: 

l s : [s] -> [s] (Identity) 
ipi : [si] -»■ [h] . . . 



F(<Pl, ■ ■ ■ , <Pn) ■ [F(S1, S n )] -» [F(t U t n )\ 

t : [F(x n )] -> [G{x n )\ (jpj : [ Si ] -> [t t ])f =1 
T(^ 1 ,...,ip n ):[F(s n /x n )]^[G(t n /x n )] 

<p:[s}^ H il> : M -> [t] 



(Structure) 
(Replacement) 
(Transitivity) 



In the (Identity) rule, [s] E ¥^£ r ^(X). In the (Structure) rule, F is a function symbol of 
rank n. In the (Replacement) rule r is a reduction rule of rank n. When the particular 
choice of X is irrelevant, we write F(Jzf) /orF_^(A). 
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Example 2.1.5. Let be the labelled rewriting theory consisting of a single binary 
function symbol <8>, an empty equational theory on terms and the single reduction rule: 

a(h,t 2 , t 3 ) : h <g) (t 2 <8) i 3 ) -> (*i ® h) <S> i 3 - 

^4 derivation of 

[A®(B®(C®D))] -> [(A®B)®(C , ®£))] 

in F(jSf) zs given fry: 

lo : [C] - [C] Id : [£»] -» [D] 

1 A ■ [A] [A] l s : [B] - [B] lo®lr> = [C®£>] - [C®D] 

a(l A ,l s ,l c ®l D ) ■ [A®(B®(C®D))] -> [(A®B)®(C®D)] 

The consistency condition in Definition 2.1.3 easily yields the following lemma, which 
asserts that we may equate reductions with their labels, thus providing a term calculus for 
the reductions. 

Lemma 2.1.6. Let be a labelled term rewriting theory. If a : s t and a : s' — > t' 
are in F(jSf), then s = s' and t = t' . □ 

At this point, we have in hand a notion of a labelled rewriting theory, which corresponds 
to the usual abstract setting of rewriting modulo an equational theory on terms. We now 
proceed to add an equational theory on reductions to this framework. This allows us 
to consider problems relating to equivalences of reductions in general rewriting systems. 
We impose two restrictions on this structure. The first is that we may only set two 
reductions to be equal if they have common sources and targets since, in applications, we 
very rarely have a sound ontological basis for equating arbitrary reductions. The second 
is that we enforce the presence of certain equations that equate reductions differing only 
in the order of rewriting nested and/or disjoint subterms. As we shall see in the following 
section, this is precisely what is needed in order to ensure a sound categorical semantics. 
The computational effect is to equate orthogonal reductions — those that do not rewrite a 
critical pair. This congruence is usually dubbed the "permutation congruence" in the term 
rewriting literature [vOdV03]. The permutation congruence is also known as "causal 
equivalence" and the congruence classes that it generates correspond to the notion of 
Mazurkiewicz traces arising in concurrency theory. The following definition states these 
concepts more formally. 

Definition 2.1.7 (Rewriting 2-Theory). A Rewriting 2-Theory is a tuple TZ := (J 7 ; T\£f, £r), 
where (J-; T \ £jr) is a labelled term rewriting theory and £q- is a set of equations on 
¥({J-;T \ £p)) satisfying the following consistency condition: 

If (tfi,ip 2 ) € £t and ipi : [si] — > [h] and ip 2 : [s 2 ] — > [t 2 ], then [si] = [s 2 ] and [ti] = [t 2 ]. 

We further stipulate that the following equations are satisfied. We refer to these equations 
collectively as the standard congruence and denote them by S(7Z) 
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l s ■ W = V (ID 1) 

<p-h = <P (ID 2) 

ip ■ (ip ■ p) = (if ■ ip) • p (Assoc) 

F(tp 1 ,...,tp n ) ■ F(lp U • • • ,Y>n) = ^(<£>1 •V'l,---,V 3 n-V , n) (Funct) 

. . . ,y? n ) = s(</?i,...,<p n ) -^(Iti, •••,!*„) ( Nat 1) 

tp((pi, . . .,ip n ) = <p(l Sl ,.. ■ ,l Sn ) • i(<£>i, . . . ,^n) (Nat 2) 

In the above, F G ^ and ip,ip, p,(pi, . . . , ip n ,tpi, . . . ,ip n are reductions in F(1Z) such that 
the above compositions are well defined. 



One of the benefits of allowing additional equations on reductions beyond that provided 
by the standard congruence is that it allows us to study invertible reduction rules, which 
arise when we recast an equational theory as a rewriting system. Moreover, it provides 
enough flexibility for us to be able to place equations on non-invertible reduction rules, 
which model phenomena such as non-reversible computations. 



Definition 2.1.8 (Invertible). Given a rewriting 2-theory (JF;T | £jr ; £ r ) } a reduction 
rule ip : [s] — > [t] in T is invertible if there is a reduction rule ip : [t] — > [s] in T and a 
variable substitution a : X — > X such that the equations tp a ■ ip a = l s <r and ip a ■ (p a = If 
are both in£r- A rewriting 2-theory is invertible if all of its reduction rules are invertible. 
We say that ip is an inverse of if. 



In defining particular rewriting 2-theories, we shall often just say that a reduction 
is invertible, without explicitly giving the data for its inverse. That is, if we say that a 
rewriting 2-theory contains an invertible reduction rule p, we mean that it also contains the 
inverse p~ l together with the necessary equations. Before proceeding, we give an example 
of a rewriting 2-theory. 



Example 2.1.9. This example gives a presentation of an invertible rewriting 2-theory 
involving associativity and unit reduction rules. We shall see in Section 2.2 that this 
example gives a presentation of the free monoidal category on a discrete category. 

The theory consists of a binary function symbol ® and a nullary function symbol I. 
We write £3 in infix notation. It has the following invertible reduction rules: 

a{ti,t 2 ,t 3 ) : ti®(t 2 ®t 3 ) -^(ti®t 2 )®t 3 
A(t) : I®t^t 
p(t) : t®I -^t 



2.2. SEMANTICS 

It has equations stating that the following diagrams commute. 

a <8 (&<8 (c<8 d))) a (g> 6 
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1®A 





(a ® 6) ® (c ® d) 



((a (g) b) <8 c) (8) d 



a® ((6®c) <8d) 



(a<8 (6<8>c)) <8d 




□ 



Definition 2.1.10. // ft := (J* 7 ; T | E T ; £ r ) is a rewriting 2-theory, then [£ T + S(K)] 
denotes the smallest congruence generated by £t and S(R) on ¥(IZ). It is generated in- 
ductively by the following rules: 

(Identity) (p € T 

(Inheritance) (ipi, <p2) € £t + S(1Z) 
(Symmetry) 





= V 




= ¥2 




= 4, 



1p = lfi 

(fl=1pl ... ip n =lpn 
F(ip 1 ,...,if n ) = F(lpi, . . . ,1p n ) 

(fl=1pl ... tp n = ^n 
T(lf 1 ,...,if n ) =r(V'l,...,V'n) 

y = j> 

(if! =ipi) : s -> u {(p 2 = i)2) u ^ t 



(Structure) F £ T n 

(Replacement) t £ T n 

(Substitution) a a substitution. 
(Transitivity) 



(991 • Vi = ^2 • V'2) : s -»■ * 
All that remains is to quotient out by the congruence generated by an equational theory 
on reductions. 

Definition 2.1.11. Given a rewriting 2-theory TZ := {F;T\£j:; £r)x, weuse¥^. T ^ £:F . £T < ) (X) 
to denote the quotient ¥ ^.^-^^(X) /[£q- + SiJZ)]. Where explicit mention of the set X is 
not necessary, we write ¥{TVj for ¥n(X). 

In the following section, we investigate the semantics of rewriting 2-theories and es- 
tablish that a rewriting 2-theory provides a presentation of a free structure carried by a 
discrete category. 



2.2. Semantics 



In this section, we shall provide a semantics for rewriting 2-theories akin to the se- 
mantics that Lawvere theories provide for syntactically defined equational varieties. The 
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appropriate generalisation of Lawvere theories to this setting is a special case of discrete 
enriched Lawvere theories — algebraic theories on categories whose hom-sets carry ad- 
ditional structure [Pow99, Pow05]. The presense of the standard congruence on the 
set of reductions is precisely what puts us in the 2-categorical setting. Had we omitted 
the requirement that Et contains the standard congruence, then we would instead be in 
the more general setting of sesquicategories, whose relationship with term rewriting was 
investigated by Stell [Ste94]. As we are in the 2-categorical setting, the hom-sets are 
themselves categories. We shall not require any deep enriched category theory but shall 
make some use of the language of 2-dimensional categories, an introduction to which may 
be found in [KS74]. 

Definition 2.2.1 (Lawvere 2-theory). A discrete finitary Lawvere 2-theory is a small 
2-category Jzf with finite 2-products, together with a finite-2-product preserving identity- 
on-objects 2-functor i : Nat op — > Jzf, where Nat is the 2-category of natural numbers and 
all maps between them. A map of discrete finitary Lawvere 2-theories =Sf — ► Jzf' is a 
finite-product preserving 2-functor 6 making the following diagram commute: 

Jzf Jzf' 
Nat op 

Since we shall not require any more sophisticated notion of Lawvere 2-theory, we use 
"Lawvere 2-theory" to mean "discrete finitary Lawvere 2-theory" . These are an alternative 
categorical presentation of strongly finitary 2-monads on Cat, studied in [KL93]. The 
way in which to visualise a Lawvere 2-theory is to think of each object as £F ra for some 
arbitrary category £F (although, strictly speaking, the objects are simply natural numbers). 
The arrows are then maps £t n — > £F m and the two-cells are maps between such arrows. 
For us, all of the arrows of Jzf will be generated by basic arrows *2) n — > corresponding 
to function symbols, and all of the two-cells will be generated by reduction rules. 

A Lawvere 2-theory is essentially a two-dimensional analogue of a free algebra. As 
in the one-dimensional case, we define a category having the structure specified by Jzf by 
product-preserving functors out of Jzf. 

Definition 2.2.2. A model of a Lawvere 2-theory Jzf in Cat is a finite-product pre- 
serving 2-functor M : Jzf — > Cat. 

In order to relate rewriting 2-theories with Lawvere 2-theories, we need to show how to 
generate a Lawvere 2-theory Th(7£) from a given rewriting 2-theory TZ. This would allow 
us to translate the purely syntactic TZ into an object that specifies an additional structure 
on a category. 

In general, there is not a strictly unique way in which to construct Th(JZ), since there 
may be many possible ways in which to express a given reduction rule, particularly in the 
case where TZ contains reduction rules such as A® A — > A. However, as we shall see, Th(7£) 
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is unique up to 2-isomorphism of Lawvere 2-theories, so the distinction is inessential for 
our purposes. 

Definition 2.2.3. Let TZ := LT\T\£t; £t) be a rewriting 2-theory. A Lawvere 2- 
theory associated tolZ is a Lawvere 2-theory Jzf containing precisely the following structure: 

(1) For every term t € F(7Z) of arity n, there is a one-cell \t\ : n — > 1 in Jzf . 

(2) For every reduction p : [s] — > [t] in ¥(TZ), there is a 2- cell \p\ : \s\ — > \t\ in Jzf. 

(3) s = t if and only if \s\ = \t\, for terms s,t £ F(7£). 

(4) a = t if and only if \a\ = \r\, for reductions a,T € F(TZ). 

It is immediate from the definition that any two Lawvere 2-theories associated to 
a rewriting 2-theory differ only in the precise way in which the function symbols and 
reduction rules are represented. This immediately implies the following lemma. 

Lemma 2.2.4. Any two Lawvere 2-theories associated to a rewriting 2-theory TZ are 
2-isomorphic. □ 

In light of the previous lemma, the following is well-defined: 

Definition 2.2.5. Th(7£) is the Lawvere 2-theory associated to the the rewriting 2- 
theory TZ. 

As it stands, the relationship between Th(7£) and TZ is still quite vague. In the re- 
mainder of this section, we shall see how to construct Th(TZ) from TZ and we shall also see 
that no "extra" equations arise from the 2-categorical nature of Th(TZ). 

Let TZ := {T\F\£^\ £r) be a rewriting 2-theory and let Jzf be the initial Lawvere 
2-theory. That is, Jzf contains no structure other than that implied by the existence of 
a finite-product preserving identity-on-objects functor i : Nat op — > Jzf. For each function 
symbol F <G T of arity n, add a one-cell |F| : n — > 1 to Jzf. Extend this inductively to 
terms by setting: 



For each equation (s, t) £ £p, we enforce an equality |s| = \t\ by making use of the cartesian 
structure of Jzf. In particular, we may make use of the following operations: 

• We may duplicate an object by making use of the diagonal map A : 1 — > 2. 

• We may delete the left hand-side of a pair of variables by making use of the first 
projection wi : 2 — > 1. 

• We may delete the right hand-side of a pair of variables by making use of the 
second projection -K2 : 2 — > 1. 

• We may commute two variables by making use of the twist map r : 2 — > 2. 

As an example, suppose that TZ contains the binary function symbol <8> and the equation 




Sn) 



®(a,®(b,c)) = ®(®{b,b),a). 
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This equation can be represented by saying that the following diagram commutes: 



2 »- 1 



1X7T1 



Axl 



— 3 



5x1 



— 2 



We may interpret the above diagram as saying that the following two deductions are 
equal: 



a, b, c 



1 



a, <g>(6, c) 
5(a,<g>0, c)) 



a, b, c 

a, b 

b, a 

b, b, a 



1 X 7Tl 



r 



A x 1 



(8) (6, b), a 
(8>((g>(6,6),a) 



x 1 



Of course, there are other ways in which to represent the equation. However, any 
choice of diagram to represent the equation induces the same congruence on one-cells. 

Next, we need to construct a two-cell |p| in S£ for every reduction p G F(1Z). We 
accomplish this by constructing a two-cell \p\ : \s\ — > |t| for every reduction p : [s] — > [t] in 
T and extending the construction inductively to arbitrary reductions as follows: 

IP'KKI, • • • , Wn\) if P = pVl) • • • > a n) 

\p\ = \ \pi\ ■ \P2\ if P = Pi - pi 

JpI ifpe^ 

As in the construction of a congruence in Jz? from f^-, there is a choice as to how to 
construct \p\ for a given p G T. However, in light of Lemma 2.2.4, this particular choice is 
inconsequential. Finally, we enforce the equation |er| = |r| for every (a, r) G <Sr- 

From our construction of Th(TZ), we have that any equation that holds in F(1Z) holds, 
after suitable translation, in Th(7£). The converse result holds but is not immediately 
obvious. That is, it is not clear that the fact that Jz? is a 2-category does not introduce 
any extra equations. 

Since we are only interested in models of in Cat, checking that all of the 2-categorical 
axioms are satisfied in ¥(TZ) amounts to checking the axioms for functoriality and natu- 
rality. 

Let r : s — > t be a reduction rule of rank n. Naturality of r amounts to the assertion 
that for all reductions <Ji : s« — > U, we have: 

T ( 1 s 1 ,---As n ) -t(ai,...,a n ) = s(a!,...,a n ) • r(l tl , . . . , l t „). 

This follows immediately from the combination of (Natl) and (Nat2). 
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Suppose that F £ T n . Without loss of generality, we may assume that F is binary. 
The functoriality of F is established as follows: 

F(<p, 1) • F(l, = F{<p ■ 1, 1 ■ V) by (Funct) 

F(ip,ib) by (ID1) and (ID2) 

= F(l • (p, V> ■ 1) by (ID1) and (ID2) 

= F(1,Y>)-F(^1) by (Funct) 

Our construction of Th(TZ) from TZ carries the message that we may view function 
symbols in TZ as functors and reduction rules in TZ as natural transformations. Thus, a 
rewriting 2-theory can be seen as giving a syntactic specification of an additional structure 
carried by a category. 

Example 2.2.6. Let TZ be the rewriting 2-theory from Example 2.1.9. Then, Th(TZ) is 
the Lawvere 2-theory for monoidal categories. 

In the following section, we discuss several systems related to rewriting 2-theories. 

2.3. Relation to other systems 

Our basic structure of a rewriting 2-theory, TZ := (J-; T \ ; £t), simultaneously gen- 
eralises several other systems, which we cover in order of increasing generality. 

(1) First order rewriting: If both Ej: and Eq- are empty and we do not impose 
the standard congruence, then we are in the setting of first order term rewriting. 
However, our construction of ¥{TZ) adds in identity reductions, which are not 
usually assumed to be present in term rewriting systems. 

(2) Rewriting modulo an equational theory: If £q- is empty and we do not 
impose the standard congruence, then we are in the setting of rewriting modulo an 
equational theory, with the same caveat as for standard first order term rewriting. 

(3) Calculus of Structures: If Eq- is empty and we do not impose the standard 
congruence, then we are also in the setting of the Calculus of Structures [Gug07, 
GS01]. This is a proof theoretic framework that extends one sided Gentzen 
systems with the ability for inference rules to act arbitrarily deeply within a 
sequent. 

(4) Rewriting logic: If Er is empty, then we are in the setting of unconditional 
rewriting logic [Mes92]. This system has its roots in concurrency theory and 
particularly in the notions of causal equivalence and Mazurkiewicz trace lan- 
guages. 

(5) Clubs: The notion of a fully covariant club was introduced by Kelly [Kel72] as a 
unified framework for covariant structures carried by a category. His description 
of fully covariant clubs is very similar to our notion of a rewriting 2-theory, with 
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several points of difference. First, Kelly's calculus of proof terms is provided im- 
plicitly by the categorical setting, whereas our calculus is generated inductively. 
Second, Kelly works purely within the framework of two-dimensional category 
theory, whereas we prefer an approach via term rewriting systems, which high- 
lights the connection with computational notions. A more substantial technical 
point of differentiation is that Kelly gives a variable-free presentation, which does 
not allow the expression of certain equations at the term level, such as the com- 
mutativity of a binary function symbol. Indeed, the only equations expressible at 
the term level in Kelly's setting are the strongly regular ones — those equations 
(s,t) where Var(s) = Var(i), each variable appears precisely once in both s and t 
and the order in which the variables appear in s is the same as the order in which 
they appear in t. 

In the following section, we introduce the coherence problem, which will be our main 
focus throughout the thesis. 

2.4. Coherence 

There are many interrelated problems that go under the name of "coherence". Ul- 
timately, all of these questions relate to describing the free algebra generated by some 
algebraic structure on a category. The original manifestation of this problem was in Mac 
Lane's investigation of monoidal categories [ML63]. Since all diagrams commute in the 
free monoidal category on a discrete category, it was this phenomenon that was originally 
associated with the term "coherence" . This was in keeping with work in algebraic topology 
on defining algebraic operations on topological spaces together with equations that hold 
only up to homotopy [Sta63]. 

It is not the case that all algebraically defined structures on categories enjoy the same 
strong coherence property that monoidal categories do. For instance, simply removing one 
of the coherence axioms from the definition of a monoidal category destroys this property. 
This observation led to Kelly reformulating the coherence problem to ask which diagrams 
commute purely as a result of the axioms [Kel72]. However, even this question may be 
too strong, for we may not be able to even decide if a given diagram commutes as a result 
of the axioms. The view that a coherence problem is essentially concerned with deciding 
whether given diagrams commute has its roots in Lambek's investigation of residuated 
structures arising in mathematical linguistics [Lam68] . 

The main thrust of this thesis is the investigation of various coherence problems for 
structures defined by rewriting 2-theories. While this does not cover the complete array of 
possible categorical structures, it is sufficiently broad so as to encompass many interesting 
and pathological examples. In this section, we set out precise definitions of the various 
coherence problems. 

One difficulty that arises when investigating coherence problems is that the commu- 
tativity of a particular diagram may have no bearing on the question at hand. For this 
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reason, we need to carefully define those diagrams and reductions that are of importance 
for us. These are the diagrams that are in "general position". That is, they contain the 
maximum number of distinct variables. Before making this precise, we need the concept 
of the shape of a reduction. 

Definition 2.4.1. Let TZ be a rewriting 2-theory. The Shape of a reduction a € ¥(TZ) 
is defined recursively by the following: 

Shape(ai) • Shaped) if a = ct\ ■ ai 

r(Shape(ai), . . . , Shape(a n )) if a = r(a±, . . . , a n ) 

i ? (Shape(ai), . . . , Shape(a„)) if a = F(a\, . . . , a n ) 

o otherwise 



Shape(a) 



In the system from Example 2.1.5, we have: 

Shape(a(l j4 , Ib, 1c)) = Shape(a(lA, 1a, 1a)) = «(°, °, °) 

We now need a precise definition of the variables present in a reduction. 

Definition 2.4.2. Given a rewriting 2-theory TZ, the set of variables in a reduction 
a € F(1Z) is defined recursively as follows: 

Var(ai) U Var(a2) if a = a\ ■ 

Ur=i Var («j) ifa = T(a 1 ,...,a n ) 

Ur=iVar(aj) if a = F(ai, . . . , a n ) 

a otherwise 



Var(a) 



Returning to Example 2.1.5, we find that 

Var(a(U, Ib, 1c)) = Ua, ±b, 1c*}, 

whereas 

Var(«(U,lA,lA)) = {lA}- 
We can finally nail down what we mean when we say a reduction has the maximum possible 
number of variables. 

Definition 2.4.3. Given a rewriting 2-theory TZ, a reduction a G ¥(7Z) is in general 
position if 

|Var(a)| = max{|Var(r)| : r G ¥(7Z) and Shape(r) = Shape(a)}. 

Example 2.4.4. Consider the system from Example 2.1.5 augmented with the following 
reduction rule: 

P(x) : x x — > x 

Then, 

a(l A , 1a, Ib) ■ (/?(U) ® Ib) : A ® (A ® B) -> A ® B 
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is in general position, whereas 

a(lA,U,ls) : A® (A®B) -> {A® A) ® B 
is not in general position. 

For coherence problems, we only need to focus on those diagrams whose reductions 
are all in general position. This allows us to define the various problems that will be our 
focus. 

Definition 2.4.5. Let 1Z := (F;T\£jr; £r) be a rewriting 2-theory. 

(1) 1Z is Mac Lane coherent if any two parallel reductions in general position in F(7Z) 
are equal. 

(2) 71 is Lambek coherent if there is a decision procedure for the commutativity of 
diagrams in general position in ¥(1Z). 

Unfortunately, deciding whether a given 2-theory is coherent in either the Mac Lane 
or Lambek sense is often impossible. 

Theorem 2.4.6. The decision problems for Mac Lane coherence and Lambek coherence 
are undecidable over the class of finitely presented rewriting 2-theories. 

PROOF. Following the work of Markov [Mar51], we know that many problems are 
undecidable for finitely presented monoids. Our basic strategy is to show how to encode a 
monoid as a rewriting 2-theory. Let M := (X\R) be a finite presentation for a monoid. Let 
1Z(M) be the rewriting 2-theory consisting of a single unary function symbol F, reductions 
Tj : F(x) — > F(x) for every n £ X and relations (ui,cjj) for every (uji,ujj) £ R. If we 
could solve the Mac Lane coherence problems for 7Z(M), then we could decide whether M 
is trivial. Similarly, if we could solve the Lambek coherence problem for 1Z(M), then we 
could solve the word problem for M. Since both of these monoid problems are undecidable 
in general, so too are the associated coherence problems. □ 

The notions of coherence that we have introduced here are focused entirely on the 
congruence present on reductions. Historically, this arose because equations on terms can 
often be converted into coherent natural isomorphisms. We explore this phenomenon in 
the following section. 

2.5. Categorification 

The fundamental group of a topological space is usually defined as the group of 
homotopy-equivalence classes of based loops in the space. This definition forgets the 
particular relationships between any two loops lying in a given equivalence class. An 
alternative approach might be to define a group structure on the space of based loops to- 
gether with explicit homotopies between elements. This notion of algebraic structure "up 
to homotopy" was introduced in [Sta63] and has been extended to handle quite general 



2.5. CATEGORIFICATION 



19 



structures [Ros07]. A problem that arises with this approach is that one then needs to 
examine the relationships between the homotopies themselves. 

Translated into our language, the above process takes a labelled rewriting theory theory 
££ := (J 7 ; T \ £jr) and replaces it with a rewriting 2-theory 7£(Jz? ) in which each equation 
in £jr is replaced with an invertible reduction. In order to retain the link between the 
and 7£(Jzf), one needs to show that Th(Jz?) ~ Th(7?.(j£?)). However, this can only be the 
case if any two sequences of the new invertible reductions in 7£(Jz?) having the same source 
and target are equal. In other words, one needs to construct 7£(Jz?) in such a way that it 
is Mac Lane coherent. 

Definition 2.5.1. A categoriflcation of a labelled rewriting theory J*f := (T;T \ £p) is 
a rewriting 2-theory 7£(Jz?) := (J 7 ; T U I{£r) \ ®\ &Tul(£ T )), where: 

(1) I{£p) consists of reductions p Sy t ■ s — ► t and p~j : t — ► s for each (s, t) E £?. 

(2) £rui(£r) con t a i ns £t> as well as the equations 

Ps,t ■ Pjj = Is 
Ps,l ■ Ps,t = It 

for each (s, t) € £?. 
1Z(J?) is a coherent categoriflcation of Jz? if it is Mac Lane coherent. 

Example 2.5.2. Monoidal categories, as defined in Example 2.1.9 are a coherent cat- 
egoriflcation of the theory for strict monoidal categories. This theory consists of a binary 
function symbol ®, a nullary function symbol I as well as equations 

a ® (b <8) c) = (a <8) b) <g> c 
a® I = a 
I ® a = a 

If we take models for the theory of strict monoidal categories to be product preserving 
functors into Set, then we recover the variety of monoids. 

A categoriflcation of an equational theory £ := (J-\£^), is a categoriflcation of the 
labelled term rewriting theory (J 7 ; | Sjr). Similarly, we can define Th(Jzf) for a labelled 
rewriting «Sf := (J 7 ; T \ £ T ) to be Th((J 7 ; T | £>, 0)). The Lawvere 2-theory Th(£) associ- 
ated to an equational theory £ is defined analogously. 

Theorem 2.5.3. Let TZ(£) be a categoriflcation of the equational theory £. There is a 
biequivalence of 2- categories Th(lZ(£)) ~ Th(£) if and only iflZ(£) is coherent. 

Proof. Let £ be an equational theory and let 1Z(£ ) be a categoriflcation of £. 
Suppose that 7Z(£) is coherent. For each congruence class of 1-cells [s] £ Th(5), pick 
a distinguished element r([s]). Define a pseudofunctor F : Th(£) — > Th(TZ(£)) by: 

• O-Cells: Identity 
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• 1-Cells: F([s]) = r([s]) 

• 2-Cells: Th(£") contains only identity 2-cells. Define F(l^) = l r ([s])- 
Next, define a 2-functor G : Th(K(£)) -» Th(£) by: 

• O-Cells: Identity 

• 1-Cells: G{s) = [s] 

• 2-Cells: Since there is a 2-Cell s — > t in Th(7£(£)) precisely when [s] = [i] in 
Th(£), we can define G(p : s — > t) = lr s i. 

It follows from the definitions that F G = l-ivf)- Since 7£(£) is coherent, the two legs 
of the following diagram commute: 

G-F(s) s 
G-F(p) P 

G ■ F(t) t 

It follows that G • F = 1, so Th(TZ(£)) ~ Th(£). 

Conversely, suppose that Th(lZ(£)) ~ Th(£). Then, there exist functors F : Th(£) — > 
Th(7fc(£)) and G : Th(7£(£)) -> Th(£) such that F • G = 1. Suppose that p 1: p 2 : s -» t 
are a parallel pair of 2-cells in Th(7£(£)). Then, F • G{pi) = F(l [s] ) = F ■ G{p 2 ). Thus, 
TZ{£) is coherent. □ 

Example 2.5.4. It follows from Theorem 2.5.3 that the theory for monoidal categories 
is biequivalent to the theory for strict monoidal categories. 

Given two rewriting 2-theories Tlx and R 2 , we define Th(7£i)UTh(7£ 2 ) := Th(7liU^ 2 ). 

Corollary 2.5.5. Let 71 := (J 7 ; T \ £? \ £r) be a labelled rewriting theory and let 
{F;I{£f) \0;£j(p)) be a coherent categoriftcation of (T\£j:). Then 

Th(ft)~Th((^;TU/^|0,5 r U£ J( ^}). 
Proof. By Theorem 2.5.3, we have 

Th(ft) = Th((jT; | 5^; 0} U {0;T \ 0; £ T }) 

= Th((^;0|^;0))UTh((0;T|0;^ r )) 

~ Th((JT; I{£ T ) | 0; 5 J(F) » U Th«0; T | 0; £ r }) 

= Th((f;ru%)|0^TU%))). 

□ 

The above corollary roughly states that, for a given rewriting 2-theory, we can switch 
between an equational theory on terms and a coherent invertible theory on terms as we 
please. This ability is very useful in investigating coherent structures. In the following 
section, we introduce some other useful concepts for investigating coherence. 



2.6. BASIC PROPERTIES 



21 



2.6. Basic properties 



This section is predominantly intended as a collection of basic concepts and results 
that will prove useful throughout the thesis. 

Given a rewriting 2-theory 1Z, we shall frequently need to break up a reduction in F(7Z) 
into a composite of smaller reductions. Since all of the reductions in ¥(TZ) are generated by 
a set of reduction rules, this process must ultimately terminate. However, it is important 
that we have some understanding of the resulting normal forms. 

Definition 2.6.1 (Singular). Let 1Z := {F\T\£p\ £?) be a rewriting 2-theory. The 
set of singular reductions in ¥(TZ) is denoted Sing(7£) and is generated as follows: 

• If p £ T n and [ti], . . . , [t n ] are congruence classes of terms in¥(JZ), then p(lt 1; . . . , L n ) 
is singular. 

• If F G T n and p is a singular reduction and 1 < i < n, then 



Example 2.6.2. In the system from example 2.1.9, the reduction 

l a <g a(l b , l c , l d ) : a <g (b <g (c <g d)) —> a <g ((6 <g> c) ® d) 
is singular, whereas the reduction 

a(l a , a(l b , l c , l d ), l e ):a® ((b <g> (c (8) d)) (g> e) -> (a (g) ((6 (g) c) (g) d)) (g) e 
is no£ singular. 

Lemma 2.6.3. Let 1Z be a rewriting 2-theory. Every non-identity reduction in F(1Z) is 
equal to a composite of finitely many singular reductions. 

Proof. Let 1Z be a rewriting 2-theory and let p be a reduction in F(TZ). Define the 
rank of p to be 



We proceed by induction on R(p) to show that p is a composite of singular morphisms. 
If TZ(p) = 1, then p is singular. 

Suppose that R(p) > 1. Suppose that p = pi ■ P2, where neither p\ nor p2 is an 
identity reduction. Then by induction each of p\ and p2 is a composite of finitely many 
singular reductions. Suppose that p = o~(ti, . . . , r n ), where a : s — > t and Tj : Sj — > tj are 
reductions in F(7?.) such that at least one Tj is not an identity map. Then, by (Nat 1) 
in Definition 2.1.7, we may rewrite p as s(ti, . . . , r n ) • (7(1^, . . . , lt n ). Since (7(1^ , . . . , L n ) 




is singular. 



R{p) = { 



'R{ Pl ) + R{p 2 ) Up = p 1 -p 2 

E7=iR(n) if P = F(r 1 ,..., 

Y%=i R ( T i) If p = o-(n,..., 

k l IfpGSing(7e) 
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is singular by induction, we may assume that p = F(t±, . . . , r n ), where Tj : [s,] — ► [tj]. 
Without loss of generality, suppose that n = 2. It follows from the functoriality of .F 
that p = F(t±, 1 S2 ) • F(l tl ,T2). By induction, each of ri and T2 is a composite of singular 
reductions. It follows then from the functoriality of F that p is equal to a composite of 
i2(p)-many singular reductions. □ 

In light of the above lemma, we know that any particular reduction is equal to a 
composite of only finitely many singular reductions. However, ¥(1Z) might still contain an 
infinite sequence of composable reductions. 

Definition 2.6.4 (Terminating). A rewriting 2-theory 1Z is terminating if any infi- 
nite sequence of composable singular reductions in F(1Z) contains cofinitely many identity 
reductions. 

Of particular importance in many investigations of various kinds of term rewriting 
systems are those terms that are not the source of any non-identity reduction. Often, one 
would like to assign such a term to an arbitrary term. 

Definition 2.6.5 (Normal Form). Let TZ be a rewriting 2-theory and let [s] be a term 
in F(7Z). A normal form for [s] is a term [t] such that there is a reduction [s] — > [t] in 
F(1Z) and there are no non-identity reductions whose source is [t] in F(1Z). We say that 
1Z has normal forms if every term in TZ has a normal form. 

In an arbitrary rewriting 2-theory TZ, a given term may or may not have a normal 
form. If TZ is terminating, then every term has at least one normal form. In the fortunate 
situation where every term in TZ has a unique normal form, many investigations become 
somewhat simpler. In order to guarantee this property, we need further restrictions on TZ. 

Definition 2.6.6 (Confluent). A rewriting 2-theory 1Z is confluent if any diagram 

[h] ^- [s] M 

in F(TZ) can be completed into a (not necessarily commutative) square: 

[s] — M 
i 

PI I 72 

y 

[h]-->[u] 

Definition 2.6.7. A rewriting 2-theory is complete if it is terminating and conflu- 
ent. Otherwise it is incomplete. 

Lemma 2.6.8. A complete rewriting 2-theory has unique normal forms. 

Proof. Let 1Z be a complete rewriting 2-theory and let t be a term in F(1Z). Since 
1Z is terminating, t has at least one normal form. Suppose that Afi(t) and A/^i) are 
normal forms for t. If N\{t) / A^(*)> then since 1Z is confluent there must be a term 
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v and reductions Ni(t) — > w for i £ {1,2} in F(1Z), contradicting the normality of these 
terms. □ 

Our investigation of coherence for rewriting 2-theories splits into two cases, correspond- 
ing to whether the theories are assumed to be complete or not with the latter case being 
somewhat more delicate. 

In the following chapter, we establish a link between algebraic invariants and coher- 
ent categorifications of equational theories. This link is particularly useful for complete 
rewriting 2-theories. 



CHAPTER 3 



Structure monoids 

In Theorem 2.5.3, we saw that a coherent categorification of an equational variety 
is equivalent to the original variety in the sense that it has an equivalent Lawvere 2- 
theory. The main purpose of this chapter is to highlight how this phenomenon arises in 
combinatorial algebra within the realm of structure monoids. Later, in Chapter 5, we shall 
exploit this connection in order to construct new presentations of some famous algebraic 
objects. 

Structure monoids were introduced by Dehornoy [Deh93] as algebraic invariants of a 
certain class of equational varieties. Dehornoy subsequently showed that Higman's groups 
F and V arise as algebraic invariants of the varieties of semigroups and of commutative 
semigroups, respectively [Deh05]. In particular, he showed how to construct presentations 
of these groups using Mac Lane's pentagon and hexagon coherence axioms for coherently 
associative and commutative bifunctors. 

The relations in Dehornoy's presentations consist of two parts. First, there are the 
so-called geometric relations, which arise purely from the fact that a semigroup is, in the 
first instance, a magma. The second class of relations arise from the particular equational 
structure of the variety at hand. In the case of F, one additional class of relations are 
added corresponding to the Stasheff-Mac Lane pentagon [ML63] and in the case of V, the 
presentation further contains a class of relations corresponding to the Mac Lane hexagon, 
which encodes the essential interaction between associativity and commutativity. 

The goal of this chapter is to place Dehornoy's constructions in a more general context. 
More precisely, we consider coherent categorifications of equational varieties. Within this 
setting, Dehornoy's geometric relations correspond to the functoriality and naturality of 
the associated categorical structure with the remaining relations arising from the coherence 
axioms. 

We recall the definition of structure monoids in Section 3.1 and go on, in Section 3.2 to 
show that a coherent categorification of an equational variety gives rise to a presentation 
of the associated structure monoid. In certain favourable situations, the structure monoid 
can be turned into a group and we show that the construction of a presentation from a 
coherent categorification carries over to this setting. 

3.1. Structure monoids 

In this section, we recall Dehornoy's construction of an inverse monoid associated to a 
balanced equational theory [Deh93]. 
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We begin by briefly recalling and exapnding upon some definitions from the previous 
chapter. For a graded set of function symbols T and a set X, we denote by ¥^(X) 
the absolutely free term algebra generated by T on X. An equational theory is a tuple 
(F\ £p)v, where V is a set of variables, J 7 is a graded set of function symbols and £? is 
an equational theory on F^-(V). A map ip : V — ► Fjf(V) is called a substitution and it 
extends inductively to an endomorphism F^(V) — > Fjr(V). By abuse of notation, we label 
this latter map by p as well. We use [V, F^(V)] to denote the set of all substitutions. For 
a term s € Fjf(V) and a substitution ip € [V, Fjr(V)], we use to denote the image of s 
under (p. The support of a term s is the set of variables appearing in it. A pair of terms 
(s,t) is balanced if they have the same support and an equational theory is balanced if 
every defining equation is balanced. 

Definition 3.1.1. Given a balanced pair of terms (s,t) mF^(V), we use p S) t to denote 
the partial function Fjr(V) — > Fjr(V) mi/i graph 

{(s*,n t^G[V,F^(V)]}. 

For a balanced pair of terms (s,t), the partial function p s ^ is functional since the 
support of t is a subset of the support of s. The stronger restriction that the pair is 
balanced is required since we wish to utilise the inverse partial function p t>s as well. 

Given an equational theory £ := (J r \£p)v, we use [£j?] to denote the congruence 
generated by £?r on Fjr(V) and we use Fg(V) to denote the quotient ¥jr(V)/[£^}. Similarly, 
we use [s] to denote the congruence class of a term s in Ff(V). It is clear that [u] = [p s ,t(u)] 
for any balanced equation (s,t) £ £?r and any term u E dom(p Si t). However, the collection 
of all partial maps p s> t for (s,t) £ £? is not sufficient to generate [£?■], since equations 
apply to subterms as well. To this end, we introduce translated versions of the maps p Sjt , 
that apply to arbitrary subterms. 

A subterm s of a term t is naturally specified by the node where its root lies in the 
term tree of t, which in turn is completely specified by the unique path from the root of t 
to the root of s in the term tree. A path in a term tree may be specified by an alternating 
sequence of function symbols and numbers, where the numbers indicate an argument of a 
function symbol. More formally, we have the following situation. 

For a graded set T := \J n T n , we set 

^ : =U U {(F,l),-..,(F,n)}. 

The set of addresses associated to T is denoted by and is the free monoid generated 
by Ajr under concatenation, with the unit being the empty string A. For a term t G Fjf(V) 
and an address a € A*^, we use sub(t, a) to denote the subterm of t at the address a. Note 
that sub(i, a) only exists if the term tree of t contains the path a and that sub(t, A) = t. 

Example 3.1.2. Suppose that T := {F, G}, where F is a binary function symbol and 
G is a ternary function symbol. Suppose that V is a set of variables. Then, the term 



3.1. STRUCTURE MONOIDS 



26 



F 




w G 




x y z 
Figure 1. The term tree of F(w, G(x, y, z)) 

t := F(w,G(x,y, z)) is in F^-(V). The term tree oft is given in Figure 1. The term t has 
the following subterms: 

sub(t, (F, 1)) = w sub(t, (F, 2)) = G(x, y, z) 

sub(t, (F, 1) (G, 1) = x sub(t, (F, 1) (G, 2) = y 

sub(t,(F,l)(G,3)) = 2 

Definition 3.1.3 (Orthogonal). Given a graded set J- and addresses a,f3 £ we 
say that a and (3 are orthogonal and write a _L if neither a nor (3 is a prefix of the 
other. Given a term t, and addresses a and [3, the subterms sub(t, a) and sub(t, (3) are 
orthogonal if a _L f3. 

Our current addressing system is sufficient to describe translated copies of the basic 
operators. 

Definition 3.1.4. Given a graded set of function symbols T , a variable set V, a bal- 
anced pair of terms (s, t) £ Fjf(V) and an address a £ A*^, the a-translated copy of p s j is 
denoted p" t and is the partial map Fp(V) — ► Fjr(V) defined as follows: 

• A term u £ Fjr(V) is in i/te domain of p^ t if sub(u, a) is defined and is in the 
domain of p s j- 

• For u G dom(p° t ), the image p° t (tt) is defined by 

sub(p° t (u),a) = p Sit (sub(n,a)) 

and sub(/9" t (n), /?) = sub(n, /3) /or ewery address (3 orthogonal to a. 
Note that p^ t = p s j- 

We are finally in a position to introduce the structure monoid generated by an equa- 
tional theory. 

Definition 3.1.5 (Structure Monoid). Given a balanced equational theory £ := {F \ £-t)v, 
the structure monoid of T, denoted Struct (T), is the monoid of partial endomorphisms of 
Fjf(V) generated by the following maps under composition: 

{Pst I ( s >*) or (*> s ) G £-T an d a £ A^} 

The structure monoid of an equational theory is readily seen to completely capture the 
equational theory. 



3.2. STRUCTURE MONOIDS VIA COHERENCE THEOREMS 



27 



Lemma 3.1.6 (Dehornoy [Deh93]). Let £ := (J-\£p)v be a balanced equational theory 
and let t,t' G Fjr(V). Then t =s t' if and only if there is some p G Struct (5) such that 
p{t) = t'. □ 

Given an equational theory £ = (J-\ £p)\) and maps p Sl ,t 1 , Ps 2 ,t 2 € Struct (T), the com- 
position p si ,ti ' Ps2,t 2 may be empty. It is nonempty precisely when there exist substitutions 
tp, Y> G [V,Fjr(V)] such that tf = s^. In this case, we say that the pair (ii,S2) is unifiable 
and that (ip,ip) is a unifier of the pair. In the case where (£1,52) is not unifiable, the 
composition /9 S i,ti ■ Ps 2 ,t2 results in the empty operator, which we denote by e. Note that, 
for any operator p G Struct (T), we have p ■ e = e ■ p = e. The existence of the empty 
operator makes freely computing with inverses in Struct(T) impossible. 

Definition 3.1.7 (Composable). An equational theory {f r \£j^)v is composable if any 
pair of terms in \J( St t)e£^{ s ^} are unifiable. 

Recall that an inverse monoid M is one in which for each element x G M, there is an 
element y G M such that xyx = x and yxy = y. Dehornoy [Deh06] shows that Struct (T) 
always forms an inverse monoid and contains the empty operator precisely when £ is not 
composable. One way in which to transform Struct (G) into a group is by passing to the 
universal group of Struct(£), which we denote by Structc(£ ), by collapsing all idempotents 
to 1. In the case where £ is composable, the idempotent elements of Struct (T) are precisely 
those operators that act as the identity on their domain. A particular class of composable 
theories is provided by a certain class of linear theories. Recall that an equation s = t 
is linear if it is balanced and each variable appears precisely once in both s and t. An 
equational theory is linear if each of its defining equations is linear. 

Lemma 3.1.8 (Dehornoy [Deh06]). A linear equational theory containing precisely one 
function symbol is composable. □ 

It follows from the above lemma that each linear equational theory containing precisely 
one function symbol gives rise to a structure group. 

Example 3.1.9. The equational theories for semigroups, S, and for commutative semi- 
groups, C, are both linear. Since these theories involve a single binary operator, Lemma 
3.1.8 implies that they are composable. In this case we have that Struct^S) is Thompson's 
group F and Structc(C) is Thompson's group V [Deh05]. 

In the following section, we shall see how structure monoids and groups relate to 
coherent categorifications of equational varieties. 

3.2. Structure monoids via coherence theorems 

The main goal of this section is to show how coherent categorifications of equational 
theories give rise to presentations of structure monoids. We base our analysis at the 
level of theories, rather than of equational varieties. While this is seemingly at odds 
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with Dehornoy's result [Deh93] that structure monoids are independent of the particular 
equational presentation of a variety, differing presentations of the same variety lead to 
distinct categorifications and thence to distinct presentations of the structure monoid. 

Dehornoy's utilisation of the pentagon and hexagon coherence axioms in order to obtain 
presentations of Thompson's groups [Deh05] is indicative of a more general relationship 
between structure monoids and coherent categorifications of equational theories. The first 
step on the road to formalising this relationship is to construct a monoid presentation out 
of a categorification of an equational theory. In light of Lemma 2.6.3, a good candidate 
for the generators of the monoid is provided by the singular morphisms of the categori- 
fication. Since we shall be moving back and forth between the structure monoid and a 
categorification, TZ(£), of an equational theory £, there is some danger of confusion about 
whether a symbol "p" lies in Struct (5) or in 1Z(£). Thus, in this section, we adopt the 
convention that an element marked as "jo" lies in 1Z(£ ) and an unmarked element "p" 
lies in Struct (£). A second notational difficulty arises due to the differing way in which 
elements of Struct(f ) and morphisms in Sing(7£(£)) are represented. For this reason, we 
give a way of rewriting singular morphisms to more closely resemble elements of Struct (5). 
If 1Z(£ ) = (J 7 ; I(£f) I <2>; £i(£ T ))-> then a reduction p £ I{£r) with source s and target t is 
written as p s j- 

Definition 3.2.1 (Type/ Address). Let 1Z{£) be a categorification of the equational 
theory £. The type, T(p Sji ) of a singular morphism p s j G Sing(7?.(£ )) is defined inductively 
by: 

T{a u ,v) ifp s ,t = • • • , h$u,v, l,-- - ,1) 
p St t otherwise. 
The address, A(p) is the word of A*j? constructed as follows: 

i-1 



T(p 



•s.t) 



A{p) 



(F,i)A(a) tfp = F(l,...,l,a,l,...,l) 
A otherwise. 



Given a categorification 1Z(£ ) of an equational theory £ , we can now construct a monoid 
whose generators are the singular reductions of 1Z(£) and whose relations are generated 
by the functoriality, naturality and coherence axioms. 

Definition 3.2.2. Let £ := {J 7 \£jr)y be a balanced equational theory and let 1Z{£) := 
(F;L{£j:) \0;£j(£ T )} be a categorification of £. The monoid S(1Z(£)) is the monoid gen- 
erated by 

{T(p) A(?) | p G Sing(7£(£))} U {p° s | (a,t) or (t,s) in £ T and a G A* r } 
iflZ(£) is composable and by 

{T(p) A(?) | p G Smg(Tl(Sjr))} U {p% | (s,t) or (t,s) in £ T and a G A* F ) U {e} 
otherwise, subject to the following relations. 
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Identity: 



Ps,s Ps,t — Ps,t 
Ps,t Pt,t — Ps,t 



Composition: If t± and S2 are not unifiable then 

P Sl ,ti Ps 2 ,t 2 - £ 



Empty operator: 



Ps,f£ = $ 

?-Ps,t = £ 



Functoriality: For a _L (3: 



n a ■ rP = rfi ■ n a 

rs,t ru,v ru,v rs,t 

• Naturality: Suppose that p Sjt € I{£r) is a generator and that some variable x 
appears at addresses (3i, . . . ,(3 P in s and at addresses j±, . . . , 7 g in t. Then, for 
all addresses a, 5 and each p U;V € I{£r) : 

~a ~a~ n & -a79<5 _ -aft 5 -a/? p 5 „ a 

Ps,t Pu,v ■ ■ ■ Pu,v — Pu^ v ■ ■ ■ Pu,v P S: t 

• Coherence: For (<7i • . . . -a p , T\ ■ . . . -T q ) G £ I ^ r ^, where each o~i and Tj is singular, 
set: 

T((7i) A(<Tl) • . . . • T(a p ) A ^ = T(ti) a(ti) • . . . • T(r (? ) A(r ' ) 

The relations for functoriality and naturality in E>(1Z(£)) are adapted from [Deh06]. 
The functoriality relation is precisely the requirement that each operator F £ T is a 
functor. The naturality condition is, in turn, precisely the requirement that each p € I{£p) 
is a natural transformation. The rather involved addressing system in the naturality 
condition is due to the fact that the same variable may appear multiple times in different 
positions on either side of an equation. For naturality, one needs to apply a map to each 
of these instances of the variable simultaneously. We now set about relating §(1Z(£)) to 
Struct (f). 

Lemma 3.2.3. Let £ be a balanced equational theory and let 1Z{£) be a categorification 
of £. Then S>(R,(£)) is an inverse monoid. 

Proof. For nonempty p := p^ tl ■ . . .#* tfc , set /T 1 := p^ ■ . Since p s , t is 

the inverse of pt, s , it follows that 

p- (T 1 ■ p = p 
p-'-p-p' 1 = P' 1 

Since we also have that e-e-e = e, it follows that S>(1Z(£)) forms an inverse monoid. □ 
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We now know that both E>(1Z(£ )) and Struct(£) are inverse monoids. Since there is 
a clear relationship between the generators of each, in order to establish that they are in 
fact isomorphic we need to focus on the relations. In particular, since 1Z(£) is an arbitrary 
categorification of £, it might contain inequivalent reductions with the same source and 
target. Since the elements of Struct(£) are partial functions completely determined by their 
domain and codomain, such a situation cannot occur in Struct(£). These considerations 
lead one to suspect that if we require 1Z(£ ) to be a coherent categorification of £ , then the 
two monoids might in fact be isomorphic. 

Theorem 3.2.4. Let£ be a balanced equational theory and letTZ(£) be a categorification 
of £. The following map is an epimorphism of inverse monoids and it is an isomorphism 
if and only if 1Z(£) is coherent: 

$(K(£)) Struct(f) 

~CK1 ~a k Ct\ Ct k 

Psi,ti ' • • • ' Ps k ,t k 1 > Ps\,t\ ■ • • • ■ Ps k ,t k 

PROOF. By construction, is a homomorphism of inverse monoids. For surjectivity, 
we need only show that every generator p° t E Struct(£) corresponds to some singular 
morphism S{p^ t ) £ Sing(7£(£ )). This singular morphism can be constructed recursively 
as follows: 



S{p%t) 



^Ps,t if ct = A 

It remains to show that G is faithful if and only if 7Z(£) is coherent. 

Suppose that is faithful and let pi,p~2 be a parallel pair of morphisms in ¥(TZ(£)). 
Then ®(pi) = 6(^2), since p\ and p2 have the same source and target. Since is faithful, 
it follows that p\ = P2- 

Conversely, suppose that 1Z(£) is coherent and that 0(pi) = ®{p2)- Then, p\ and 
have the same source and target. Since 1Z{£) is coherent, it follows that p\ = p\. □ 

The above theorem is very closely linked with Theorem 2.5.3. The essential insight is 
that Struct(£) is simply a monoid encoding of F(£), while §(7Z(£)) is a monoid encoding 
of ¥(TZ(£)). In order to extend this correspondence to structure groups, we need to modify 
our presentations slightly. 

Definition 3.2.5. Let £ be a balanced composable equational theory and let TZ(£) be 
a categorification of £. The group * s generated by 

{T{p) A ^ IpeSing^))}, 

subject to the functoriality, naturality and coherence relations from Definition 3.2.2, to- 
gether with the following relation: 



(pltY 1 = P°t 



id 
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Following the same line of reasoning as in the proof of Theorem 3.2.4, we obtain the 
following relationship between §g(7?.(£)) and Structc(5). 

Theorem 3.2.6. Let £ be a balanced, compos able equational theory and let 7Z(£) be 
a categorification of £ . The following map is an epimorphism of groups and it is an 
isomorphism if and only if 1Z{£) is coherent: 

S G (TZ(£)) Struct G (£) 



rsi,ti ■ ■ ■ rs k ,t k Psi,ti • • • l J s k ,t k 



□ 



Example 3.2.7. As we saw in Example 3.1.9, the structure group for semigroups is 
Thompson's group F and the structure group for commutative semigroups is Thompson's 
group V , which is the first known finitely presented infinite simple group. It follows from 
Theorem 3.2.6 and Mac Lane's coherence theorem for monoidal categories [ML63] that 
we may construct a presentation for F using the pentagon coherence diagram displayed in 
Example 2.1.9. A categorification of the theory of commutative semigroups contains an 
invertible reduction rule r : a <8> b — ► b <S> a. It follows from Mac Lane's results [ML63] 
that a coherent categorification of the theory is provided by requiring r ■ r = 1, together 
with the pentagon axiom and the hexagon axiom, which states that the following diagram 
commutes: 

a <g> (6 <8> c) — *- (b <g) c) ® a — b <g> (c <g> a) 



(a <g> b) <8> c s- (b <g> a) (8) c >■ b <g) (a <8) c) 

This coherence theorem allows us, as a result of Theorem 3.2.6, to construct a presentation 
of Thompson's group V . These presentations for F and V are the same as those constructed 
by Dehronoy [Deh05]. 

Paraphrasing theorems 3.2.4 and 3.2.6, whenever we have a coherent categorification 
of a balanced equational theory, we automatically have a presentation of the associated 
structure monoid or group. As we shall see in Chapter 5, this is a reasonably powerful 
result, allowing us to obtain presentations of certain important infinite groups. However, 
before we can embark upon that investigation, we need a way of constructing coherent 
categorifications and proving that a given rewriting 2-theory is coherent. The following 
chapter solves these problems for rewriting 2-theories that are terminating and confluent, 
which is precisely the situation that arises in Chapter 5. 
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Coherence for complete theories 



In order to obtain a general coherence theorem for rewriting 2-theories, one needs to 
find distinguishing features of the underlying rewriting theory that make the investigation 
tractable. As a first port of call, one might examine Mac Lane's proof of coherence for 
monoidal categories [ML63] . Looking at this proof from the angle of rewriting theory, one 
notices several things. First, every reduction rule in the structure, presented in Example 
2.1.9, is invertible. Second, an analysis of the rewriting system consisting of only the 
positive maps a, A, p reveals that this subtheory is complete. Finally, one only needs to 
show that each term has a unique reduction to its unique normal form in order to rapidly 
conclude coherence. An approach along these lines is used by Johnson [Joh87] in order 
to develop a general coherence theorem for pasting diagrams in n-categories. 

Similar considerations led Merries to formulate the notion of "universal confluence" for 
a term rewriting theory within his framework of axiomatic rewriting theory [Mel02]. In 
order to formulate this concept within our setting, we require the notion of a commuting 
joining. 

Definition 4.0.8 (Span). A span, S, in a rewriting 2-theory 1Z is a diagram of the 
form 

<pi y>2 
s 9-^2 

in ¥(11). A joining of S is a term t of¥(lZ) together with reductions ipi : u\ — > t and 
ip2 '■ U2 — > t in ¥(11). Pictorially, a joining is: 




We call S joinable if a joining of S exists and we call S commuting- joinable if a joining 
exists such that the above diagram commutes. 

Universal confluence is intended to capture the strong version of confluence present 
within monoidal categories, which coincides with the presence of pushouts in the free 
monoidal category on a discrete category. More specifically, it may be described as follows: 

For every span u\-* s and for i £ {1,2}, there is a commuting 

joining ipi : m — > t such that for any other commuting joining n : U{ — > t, 
there is a unique map p : t — > v making the following diagram commute: 
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7 



s 



t 




For a general rewriting 2-theory, the map p in the above diagram does not necessarily 
exist. However, whenever every reduction rule is invertible and the positive subtheory has 
unique normal forms, as is the case for monoidal categories, we can construct p quite easily. 
Indeed, since there are maps s — > t and s — ► v, both t and v must have the same normal 
form Af(s). This means that there is a map Nt : t Af(s) and a map iV„ : v — ► AA(s) and 
we may simply take p to be • N^ 1 . 

When 1Z contains non-invertible rules, the existence of p is no longer guaranteed. 
Surprisingly though, the invertibility of the rules is not crucial for coherence. This was first 
demonstrated by Laplaza's coherence theorem for categories with a directed associativity 
map a : a (g> (b (g> c) — > (a ® 6) <8> c that is not necessarily invertible [Lap72a]. Remarkably, 
the only coherence axiom required for this result is Mac Lane's pentagon — precisely what 
is required in the invertible case. 

We are now in the situation of needing to discern conditions on a confluent and ter- 
minating rewriting 2-theory that ensure Mac Lane coherence. Our approach needs to be 
delicate enough to handle both the invertible and the non-invertible case, since the same 
coherence axioms usually suffice for both. Ultimately, we shall end up with a slightly 
weaker and more general concept than universal confluence, essentially not requiring the 
existence of the map p. 

Our approach requires some classical tools and lemmas from first order term rewriting 
theory and we briefly cover the required material in Section 4.1. In Section 4.2, we develop 
a practical general coherence theorem for complete rewriting 2-theories and extend this 
result to invertible theories in Section 4.3. 



The focus of this section is on several classical lemmas that make the examination 
of confluence for finitely presented rewriting theories tractable. This analysis essentially 
reduces to enumerating over the possible ways in which two reductions can diverge within 
the theory. In other words, what we seek is some sort of classification of all possible spans 
that can arise from the theory. In light of Lemma 2.6.3, we can begin by focussing our 
attention on singular reductions. 

(pi <p2 

Definition 4.1.1. A span u±< s ^112 is singular if both ipi and tp2 are singular. 



4.1. Classical lemmas 



In the case where the rewriting theory is terminating, Newman's Lemma reduces con- 
fluence to showing that every singular span is joinable. 
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Lemma 4.1.2 (Newman's Lemma [New42]). A terminating rewriting 2-theory is con- 
fluent if every singular span is joinable. □ 

A singular span u\-^- — s — > ui may take one of three forms: 

• ipi and if2 rewrite disjoint subterms of s. 

• ipi and (f2 rewrite nested subterms of s. 

• <pi and if2 rewrite overlaping subterms of s. 

In practice, it is the rewriting of overlapping subterms of s that can lead to non- 
confluence. It is, therefore, important to define precisely what we mean when we say that 
two reductions overlap. Before we do this, we need to identify all possible places where a 
reduction rule could apply. 

Definition 4.1.3 (Redex). Let Jz? := (T;T\£f) x be a labelled rewriting theory and 
let p : [s] — > [t] be a reduction rule in T. For a substitution a : X — > Fj^(X) and a term 
u £ [s], the term u a is called a p-redex. 

We are now in a position to define overlapping reduction rules. 

Definition 4.1.4 (Overlap). Let Jz? := (J 7 ; T \ £p)x be a labelled term rewriting theory 
and let t £ Ff(X). Two subterms t\,t2 of t overlap if they share at least one function 
symbol occurence. Two reduction rules pi : [si] — > [ti] and p2 : [S2] — ► [^2] T overlap if 
there is a term t containing instances of a p\-redex r\ and a p2-redex r2 such that r\ and 
r2 overlap. We do not count the trivial overlap between a redex r and itself unless r is a 
redex of two different reduction rules. 

Example 4.1.5. In the positive subtheory of the theory for monoidal categories given 
in Example 2.1.9, the reduction rules p and A overlap on the term a <8) (I ® b) and the 
reduction rule a overlaps nontrivially with itself on the term a <S> (b <S> (c ® d)). 

With Newman's Lemma in mind, we now restrict our focus to singular reductions that 
rewrite overlapping terms. Unfortunately there may be infinitely many such spans, even 
for finitely presented theories. However, if we know that a certain span is joinable, then 
we automatically know that all substitution instances of it are joinable. Therefore, we can 
refocus our investigation on finding a minimal set of overlapping spans S such that any 
overlapping span is a substitution-instance of a member of S. 

Definition 4.1.6. Let T be a graded set of function symbols. Given terms t,u G 
Fjr(X), we say that u is an instance oft if there is a substitution a such that u = t a . A 
term v € W^(X) is a common instance of the terms t,u £ Fjr(X) if it is an instance of 
both t and u. The term v is the most general common instance of t and u if any other 
common instance oft and u is also an instance of v. 

Two terms may not have a common instance but when they do, they are guaranteed 
to have a most general common instance. The reader may find a proof of the following 
lemma in [DJ90]. 
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Lemma 4.1.7. Let J- be a graded set of function symbols. Ift,u £ Fjf(X) have at least 
one common instance, then they have a most general common instance. □ 

Given two overlapping reductions, we can bootstrap the notion of most general common 
instance in order to obtain the "most general" way in which the two reduction rules can 
overlap. Before we do this, however, we need to know precisely how two reduction rules 
overlap. This information is provided by the following lemma, a proof of which may be 
found in [KdV03, Lemma 2.7.7]. 

Lemma 4.1.8. Two reduction rules p\ : Si — > t\ and p 2 '■ S2 —> ti overlap if and only if 
there is a non-variable subterm of si that can be matched with a p2-redex or a non-variable 
subterm of S2 that can be matched with a pi-redex. □ 

In order to facilitate our definition of the "most general" overlap of two reduction 
rules, we need a way of specifying a distinguished subterm of a term. To this end, we 
use the notation t{s} to denote a term t with a distinguished subterm s. We may apply 
rewrites directly to the subterm s. If p : s — > s' is some reduction, then we may apply 
t{p} : t{s} - t{s>}. 

Definition 4.1.9 (Critical span). Consider a pair of overlapping reduction rules p\ : 
[£i] — > [ri] and pi : [£2] — > \r2[. By Lemma 4-1-8, we may assume that £\ = t{u} and 
that there are substitutions a,T such that u a = £\. By Lemma 4-1-7, we may assume that 
u a = £\ is a most general common instance of u and £2- Then, the following span arising 
from this overlap is called a critical span: 

Example 4.1.10. Consider the positive theory for monoidal categories given in Exam- 
ple 2.1.9. We then have the following reduction rules: 

a{ti,t 2 ,t 3 ) : h (8) (t 2 (8) t 3 ) (h <g> t 2 ) ® t 3 
A(t) : I®t^t 
p(t) : t®I -> i 

By Lemma 4-1-8, in order to find all overlaps between the reduction rules, we need only 
insert redexes of reduction rules as subterms of redexes of other reduction rules. 

The reduction rule a contains two instances of®. Thus, it overlaps nontrivially with 
itself and leads to the following critical span: 

a®(b® (c®d)) 
(a <g> b) (g> (c (8> d) a <g> ((6 ® c) ® d) 
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Furthermore, a overlaps with A and p in three possible ways, leading to the following 
critical spans: 



(4.2) (I ® b) ® c^- / ® (6 <g> c) - 



(4.3) (a <8> /) ® c^— a <g> (J <g> c/^-^a <g> 6 

(4.4) (a (8) b) <g> a®{b®I) <g> 6 

Finally, A anc? p overlap with each other, leading to the following critical span: 



(4.5) It — — I ® I — — *-I 

This exhausts all of the critical spans arising in the theory. 

As mentioned previously, the main utility of critical spans is that they drastically 
reduce the number of spans we need to check for joinability when investigating confluence. 
This result is embodied in the critical pairs lemma, so named because critical spans are 
usually identified with their pair of reduced terms. The reader may find a proof of the 
Lemma in [KdV03, Lemma 2.7.15]. 

Lemma 4.1.11 (Critical Pairs Lemma). Let Jz? be a labelled rewriting theory. Every 
singular span in Jz? is joinable if and only if every critical span in Jzf is joinable. □ 

In the following section, we develop a general coherence theorem for rewriting 2-theories 
having unique normal forms. Our basic strategy is to obtain versions of Newman's Lemma 
and the Critical Pairs Lemma that take into account the commutativity of the diagrams 
involved. This leads to some additional subtleties, but the basic strategy remains close to 
this section. 

4.2. Coherence for directed theories 

In this section, we develop a coherence theorem for terminating and confluent rewriting 
2-theories. For this, we shall need to refine our notion of confluence. 

Definition 4.2.1. Let TZ be a rewriting 2-theory. A span S in F(1Z) is commuting- 
joinable if there is a joining of S that commutes in F(1Z). We say that TZ is commuting- 
confluent if every span in F(1Z) is commuting- joinable and we say that TZ is locally 
commuting-confluent if every singular span in ¥{TZ) is commuting-joinable. 

We are now in a position to obtain a strong form of Newman's Lemma that includes 
information on the commutativity of diagrams. 



Lemma 4.2.2 (Strong Newman's Lemma). Let TZ be a terminating, locally commuting- 
confluent finitely presented rewriting 2-theory. Then: 
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(1) Every term s € F(7Z) has a unique normal form Af(s). 

(2) Any two reductions from s to ftf(s) in F(TZ) are equal. 

Proof. Part (1) follows from the classical Newman's Lemma, since TZ is terminating 
and confluent. 

For Part (2), suppose that s is a term in F(7Z) and let p and ip be two reductions 
from s to Af(s) in F(TZ). Then, it follows from Lemma 2.6.3 that p = p\ ■ p2, where p\ 
is singular. Similarly, ip = ipi ■ ip2, where ip\ is singular. Suppose that p\ : s — > u\ and 
ipi : s — > U2- Then, these two arrows form a singular span, which by assumption has a 
commuting joining n : Ui — > t, where t is a term in F(TZ) and i G {1,2}. Since there 
is a reduction s — > t in F(7£), it follows from Part (1) that Af(t) = Af(s) and there is a 
reduction p : t — ► ftf(s). For a term a, let //(a) be the length of the longest reduction from 
a to A/"(a) in F(7£) that does not contain an identity reduction. This is well defined since 
TZ is terminating and finitely presented. We proceed by induction on /i(s) to show that 
<p = ip by showing that the following diagram commutes in F(JZ): 



If /Lt(s) = 0, then s = AT(s) and tp = ijj = l s . Suppose that /j,(s) > 0. Without 
loss of generality, we may assume that neither <p\ nor ipi is l s . Then, since there is a 
reduction from s to u\ and one from s to U2, it follows from Part (1) that Af(ui) = 
M{u2) = M{s). Hence, fJ,(u±) < fi(s) and ^(^2) < an( i ^ follows from induction that 
the subdiagrams labelled (2) and (3) in the diagram above commute. Since the diagram 
labelled (1) commutes by assumption, we have that <p = ip. □ 

By the preceding lemma, we know that each term in a terminating, locally commuting- 
confluent and finitely presented rewriting 2-theory has a unique reduction to a unique 
normal form. In order to pass from this fact to a general coherence theorem, we need a 
way of extending this result to arbitrary parallel pairs of reductions. The property that 
turns out to be most useful for achieving this is for every reduction to be monic. 

Definition 4.2.3. A rewriting 2-theory (J 7 ; T \ Ejr • Et) is monic if whenever (pi ■ 
ip,tp2 • ip) € £t niodulo the basic congruence, we have (pi,p2) £ &r- 

Recall that an arrow g : b — > c in a category ( rf is called "monic" if for every pair of 
arrows /1, fi : a — ► b in if f\ ■ g = fi ■ g then f\ = f2- The following lemma follows 
immediately from the construction of F(JZ) for a rewriting 2-theory TZ. 
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Lemma 4.2.4. IflZisa monic rewriting 2-theory, then every arrow in F(1Z) is monic. 

□ 

We now have all the necessary ingredients for a general coherence theorem. 

Theorem 4.2.5 (Coherence). A finitely presented rewriting 2-theory is Mac Lane co- 
herent if it is monic, terminating and locally commuting- confluent. 

PROOF. Let 1Z be a rewriting 2-theory satisfying the hypotheses. Suppose that t\ and 
t 2 are two reductions s — > t in general position in F(1Z). By Lemma 4.2.2, there is a unique 
arrow p : t — > J\f{t) and Af(s) = J\f(t). So, T\ ■ p and t 2 ■ p are two arrows s — ► A/"(s) in 
F(7£). Lemma 4.2.2 implies that <p\ ■ p = tp 2 ■ P- Since 1Z is monic, it follows from Lemma 
4.2.4 that ifi = ip 2 . □ 

Theorem 4.2.5 effectively reduces the problem of showing that a rewriting 2-theory is 
coherent to showing that the underlying term rewriting system is terminating and conflu- 
ent. In order to make effective use of this coherence theorem, we establish a strong form 
of the Critical Pairs Lemma. 

Lemma 4.2.6 (Strong Critical Pairs Lemma). A rewriting 2-theory is locally commuting- 
confluent if and only if every critical span is commuting-joinable. 

PROOF. By definition, every critical span in a locally commuting-confluent rewriting 
2-theory is commuting-joinable. For the converse direction, let 7Z be a rewriting 2-theory 

in which every critical span is commuting-joinable. Let S := s be a singular 

span in F(1Z). We can distinguish three possibilities for this span: 

(1) ip and ip rewrite disjoint subterms of s. Without loss generality, we may assume 
that s = F(ti,t2), that if = F((p', L 2 ) and that ip = F(l tl , tp'); where (p' : t± — > t[ 
and ip' : t 2 — > tf 2 . Then, we have 

F( V ',^):F(t 1 ,t 2 )^F(t' 1 ,t' 2 ). 

By the functor iality of F, we have 

iV,i).F(i,vO = *V • i, i ■ v') 

= F(lV^'-l) 
= F(l,^')-F(<p',l) 

So, S is commuting-joinable to F(ti,t 2 ). 

(2) (p and i/j rewrite nested subterms of s. Without loss of generality, we may assume 
that s = p{q}, that ip = (p'{l q } and that ip = l p {if)'}; where ip' : p — > p' and 
tp ■ q — > q' ■ Then we have 

ip'{tp'} : p{q} -> p'{q'}. 
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If <p' is an instance of a left-linear reduction rule, then using (Nat 1) and (Nat 2) 
we get that S is commuting-joinable to p'{q'} via the following chain of equalities: 

A similar argument works when (p' is an instance of a non left-linear rule, using 
step (1) to rewrite the residuals of q in parallel. 
(3) (p and ip rewrite overlapping subterms of s. Without loss of generality, we may 
assume that <p : s — > si and </? : s — > S2- By the definition of a critical span, S 1 
is then a substitution instance of a critical span, which is commuting-joinable by 
assumption. 

By the constuction of F(1Z), it follows that S is commuting-joinable. □ 

Example 4.2.7. In this example we utilise Theorem 4-2-5 to obtain a straightforward 
proof of the coherence theorem for categories with a directed associativity map. This co- 
herence theorem is the main result of [Lap72a] . 

Let 7Z be the rewriting 2-theory consisting of a single binary function symbol <8>, the 
reduction rule 

a(x, y, z) : x <g> (y <g> z) — > (x <g> y) ® z 

and the left-hand diagram from Example 2.1.9 as an equation on reductions. Then, 1Z is 
terminating by induction with the ranking function 



Pit)- 

The only critical span in this system arises as 



p(a) + 2p(b)-l ift = a®b 
1 otherwise. 



a <g> (6 (c®d)) 

(a (8) 6) <g> (c (8> d) a®((b®c)®d) 

By the equation we placed on reductions, this critical pair is commuting-joinable to ((a (g> 
6) <8> c) <8> d. By Lemma 4-2.6, 1Z is locally commuting- confluent. By Lemma 4-2. 4, 7Z is 
also monic. So, we may apply Theorem 4-2.5 and conclude that 1Z is coherent. 

Considering the above results, one may be tempted to massage an incomplete rewriting 
2-theory into a complete one and thus apply the coherence theorems. Indeed, the famous 
Knuth-Bendix completion algorithm [KB 70] achieves precisely this. Unfortunately, such 
a procedure typically adds additional reduction rules to the rewriting theory and this is 
certainly the case with the Knuth-Bendix algorithm. Rather than simplifying the coherence 
problem, this additional structure results in a new rewriting 2-theory with a completely 
independent coherence problem whose solution sheds very little light on the coherence 
problem for the original theory. 
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In the following section, we tackle the problem of coherence for invertible rewriting 
2-theories. 

4.3. Coherence for invertible theories 

It is not immediately obvious whether Theorem 4.2.5 can be extended in any mean- 
ingful way to invertible rewriting 2-theories. The reason for this is that such systems are 
necessarily non-terminating. We can, however, sidestep this problem by restricting our 
attention to an orientation of a rewriting 2-theory. 

Definition 4.3.1 (Orientation). Let 1Z := (T;T\Sjr; £ r ) be a rewriting 2-theory. An 
orientation ofTZ is a function O : T — > {1,-1} such that: 

(1) 0{a) = 1 for any non-invertible rule a. 

(2) For an invertible pair of rules (a, (3), either: 

• 0(a) = 1 and 0(J3) = -1, or 

• 0(a) = -1 and 0(J3) = 1. 

Given an orientation on a rewriting 2-theory 1Z, we can restrict our attention to a 
directed subtheory of 1Z. 

Definition 4.3.2. Given a rewriting 2-theory 7Z := (T; T \ £p ; £q-) with orientation 
O, a reduction rule a G T is positive if 0(a) = 1 and negative otherwise. The positive 
subtheory of 1Z relative to O arises from 1Z by discarding all negative reduction rules from 
T and discarding all equations from £q- that contain an instance of a negative reduction 
rule. 

Working relative to an orientation, we can now extend Theorem 4.2.5 to invertible 
theories. 

Theorem 4.3.3 (Coherence). A finitely presented invertible rewriting 2-theory is Mac 
Lane coherent if it has an orientation whose positive subtheory is terminating and locally 
commuting- confluent. 

Proof. Let 1Z be an oriented rewriting 2-theory satisfying the hypotheses and let 1Z + 
be its positive subtheory. For a reduction ip G F(7£), we write if) -1 for its inverse. Suppose 
that (p : A — > B is a reduction in W(1Z). By Lemma 2.6.3, 

, <P1 <P2 <Pn-l <fin D 

where each ipi is singular. Say that ipi is positive if it contains an instance of a positive 
reduction rule and negative otherwise. Since TZ + is terminating and locally commuting- 
confluent, Lemma 4.2.2 implies that each term t G F(7Z) has a unique positive map Nt : 
t — > Af(t) to a unique normal form M(t). We claim that each rectangle in the following 
diagram commutes: 
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A 



si 



<P2 



S2 



<P3 



fn-1 



Sn-1 



B 



N A 



N s 



N B 



M{A) 



M{ Sl ) 



M{s 2 ) 



n-l 



If (fi is positive, then it follows immediately from Lemma 4.2.2 that tpi ■ N Si = N Si _ l . 
If Lfi is negative, then Lemma 4.2.2 implies that (p^ 1 ■ N s ._ 1 = N s ., which implies that 
Pi ■ N Si = N Si _ 1 . Since each rectangle commutes, we have ip ■ Nb = Na, which implies that 
ip = Na ■ Ng 1 . Since Na and Nb are unique and we did not rely on a particular choice of 
ip, we conclude that 1Z is coherent. □ 

Example 4.3.4. In this example, we sketch a proof of Mac Lane coherence for monoidal 
categories. From Example J^.1.10, we know a set of critical spans for a certain positive 
subtheory of monoidal categories. This subtheory is terminating, as is readily verified by 
the following ranking function: 



P(t) 



p{a) + 2p(b) 
1 



1 



ift = a<£>b 
otherwise. 



In order to conclude coherence, we need only show that every critical span is commuting- 
confluent. From the definition of monoidal categories, we know that critical spans (4-1) and 
(4-3) are commuting-joinable. In his original definition of monoidal categories [ML63], 
Mac Lane included additional axioms providing commuting joinings for the remaining crit- 
ical spans. However, Kelly later showed [Kel64] that these critical spans are commuting- 
joinable as a consequence of the pentagon and triangle axioms for monoidal categories. 
It follows from Lemma 4-%-6 that the positive subtheory for monoidal categories is locally 
commuting- confluent. We may then apply Theorem 4-3.3 and conclude that the theory for 
monoidal categories is Mac Lane coherent. 

The basic approach to Mac Lane coherence outlined in this chapter of proving termi- 
nation and then analysing the critical spans can be successfully used to obtain coherence 
theorems for various other structures arising in the literature, such as distributive cat- 
egories [Lap72b] and weakly distributive categories [CS97]. In light of the results of 
Chapter 3, this approach may potentially be used to construct presentations of structure 
monoids and groups. The following chapter details a successful application of this strategy 
to constructing presentations of the Higman-Thompson groups. 



CHAPTER 5 



Catalan categories 

Thompson's groups F and V [Tho80] are important objects arising within combinato- 
rial group theory. The group F was originally introduced by Thompson in his investigation 
of word problems in finitely generated simple groups. This group was later rediscov- 
ered by homotopy theorists as the automorphism group of a free homotopy-idempotent 
[Dyd77b, Dyd77a, FH93]. The group F has several interesting properties. For instance, 
it is finitely presentable, has a simple commutator subgroup, has only abelian quotients, 
does not contain a nonabelian free group, is totally orderable and has exponential growth 
[CFP96]. 

In unpublished notes, Thompson showed that the group V is a finitely presented infinite 
simple group — the first known group of this type. McKenzie and Thompson [MT73] 
later described F as a group generated by the variety of semigroups. As we saw in Chapter 
3, the relation of F with associativity is again reflected by the fact that it is the structure 
group for the variety of semigroups. We also saw that the group V is the structure group 
for commutative semigroups and sketched how Dehornoy's presentations for these groups 
[Deh05] arise from the coherence theorems for coherently associative and commutative 
bifunctors. 

As shown by Higman [Hig74] , V is in fact a member of the infinite family of groups 
G n y, where n > 1 and r > are integers. In particular, V = G^i- These groups share 
many of the properties of V . For instance, they are infinite, finitely presentable and are 
either simple or have a simple subgroup of index 2. Brown [Bro87] subsequently showed 
that Thompson's group F fits into a similar infinite family F n)T , where F = i*2,i- 

In Section 5.1, we recall Brown's definitions of F n ^ and G n ,i. These groups are defined 
in a very similar way to F and V, which may lead one to wonder whether they too are 
structure groups of certain equational varieties. 

In Section 5.2 we introduce n-catalan algebras, which encode a notion of associativity 
for an n-ary function symbol and prove that F nt i is the structure group of the variety 
of n-catalan algebras, directly generalising the relation between F and associativity. We 
follow this with a definition of symmetric n-catalan algebras, which encode a notion of 
associativity and commutativity for an ro-ary function symbol and we show that G Ht i is 
the structure group of the variety of symmetric n-catalan algebras. 

We know from Chapter 3 that a coherent categorification of a balanced composable 
equational variety yields a presentation for the associated structure group. In Section 5.3, 
we construct a coherent categorification of the variety of n-catalan algebras, thus obtaining 
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a presentation for i^i. Finally, in Section 5.4, we construct a coherent categorification 
of the variety of symmetric n-catalan categories, thus obtaining a presentation for G n ^\. 
These presentations are closely linked to Dehornoy's presentations for F and V, which we 
sketched in Example 3.2.7. 

The rewriting 2-theories that we construct in sections 5.3 and 5.4 are also interesting 
from a purely categorical point of view as they directly generalise the Mac Lane pentagon 
and hexagon coherence axioms for commutative and associative bifunctors. For functors 
of arity greater than 2, new coherence phenomena appear, which are not present in the 
classical binary case. 

5.1. The groups F n ^ and G n i 

In Chapter 3, we saw that Thompson's groups F and V arise as structure groups of 
certain balanced equational theories and we subsequently obtained presentations for these 
groups via coherent presentations of their associated categorical theories. In this section, 
we introduce generalisations of these groups due to Brown [Bro87] and Higman [Hig74], 
which we call F n i and G n ±, respectively. In the following sections, we shall see how 
the aforementioned process of constructing presentations for F and V generalises to this 
broader class of groups. 

There are several paths to defining the groups F„ i and G ra ,i, all of which relate to the 
fact that each of these groups arises as a subgroup of the automorphism group of a Cantor 
set. Of the myriad of definitions available, we choose to follow the description of Brown 
[Bro87], which utilises certain equivalence classes of pairs of finite rooted trees. 

Definition 5.1.1 (Tree). The set of n-ary trees is defined inductively as follows: 

• The graph consisting solely of a single vertex is an n-ary tree. 

• If Ti, . . . ,T n are n-ary trees then the following is also an n-ary tree: 




Ti T 2 ■■■ T n 

The root of an n-ary tree is the unique vertex of valence or n — 1. The leaves of a 
rooted tree T are the vertices of valence or 1 and we denote the set of leaves by £(T). 

Definition 5.1.2 (Expansion). A simple expansion of an n-ary tree T is the tree 
obtained by replacing a leaf v of T with the following: 

v 




ai(v) a 2 {v) ■ • • a n (v) 

In the above diagram, each cti is simply a label for the relevant leaf. An expansion of an 
n-ary tree is a tree obtained by making finitely many succesive simple expansions. 
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Given two trees Ti and T2 having a common expansion S, we say that S is a minimal 
common expansion if any other expansion S' of T\ and T2 is an expansion of S. 

Lemma 5.1.3 (Higman [Hig74]). Any two finite n-ary trees have a minimal common 
expansion. □ 

The underlying sets of the groups F nj i and G n ^\ consist of certain formal expressions 
called tree diagrams. 

Definition 5.1.4 (Tree diagram). An n-ary tree diagram is a triple (Ti,T2,<r), where 
T\ and T2 are n-ary trees having the same number of leaves and a is a bijection £{T\) — > 

As in the case of trees, we may talk about expansions of tree diagrams. 

Definition 5.1.5. A simple expansion of an n-ary tree diagram (Ti, T2, a) is an n-ary 
tree diagram (T^T^o 7 ) obtained by the following procedure: 

• T[ is the simple expansion of Ti along the leaf I . 

• T2 is the simple expansion of T 2 along the leaf a (I). 

• a' is the bijection £(T{) — > £(T£) defined by setting a'(k) = a(k) for k G £(Ti)\{l} 
and a'{oti{l)) = oti{a(l)). 

An expansion of an n-ary tree diagram (T±,T2,a) is any n-ary tree diagram obtained by 
making finitely many succesive simple expansions of (T±,T 2 ,a). 

Let ~ be the equivalence relation on the set of n-ary tree diagrams obtained by setting 
(Ti,T2,cr) ~ (T{, T2, ct') whenever (Ti,T2,cr) and (T[,T<2,(t') possess a common expansion. 
Let [(Ti, T2, a)] denote the equivalence class of (Ti,T2,<r) modulo ~. We call [(Ti, T2, a)] 
an n-ary tree symbol. 

Definition 5.1.6. For n > 2, we set G U) \ to be the group whose underlying set is the 
collection of n-ary tree symbols, together with the following group structure: 

• Given two n-ary tree symbols [(Ti,T, a)] and [(T' ,T 2 ,a')], it follows from 
Lemma 5.1.3 that we may assume that T = T' . We define their product to be 

[(T 1 ,T, ( 7)][(T,T 2 ,(7 / )] = m^a-a')]. 

• The inverse of [(Ti, T 2 , a)] is [(T 2 , Ti, a' 1 )} . 

• The unit element is [(T,T,id)]. 

It follows from the definitions that any n-ary tree is an expansion of the tree consisting 
solely of a single vertex. Thus, the leaves of an n-ary tree may be seen as a subset of the 
free monoid on {1, . . . , n}. Therefore, we may order the leaves of the tree lexicographically, 
which is equivalent to ordering the leaves left-to-right when drawn on a page. We say that 
an n-ary tree symbol [(Ti, T2, <r)] is order-preserving if a is an isomorphism of ordered sets; 
that is, if a preserves this ordering. 



5.2. F n ,i AND G„,i AS STRUCTURE GROUPS 



45 



Definition 5.1.7. For n > 2, we set F Uj i to be the subgroup of G n> i consisting of the 
order-preserving n-ary tree symbols. 

The groups F n ^ and G n ,i generalise Thompson's original groups i 7 and V, since we 
have F21 — F and G^i — V. They also share several of the interesting properties of 
F and V as surveyed in [Sco92]. In the following section, we shall realise F n> i as the 
structure group of higher-order associativity and G n> i as the structure group of higher 
order associativity and commutativity. 

5.2. F n: i and G n 1 as structure groups 

Our goal in this section is to realise F n ^ and G n ^\ as structure groups. Since both of 
these groups are built using maps between n-ary trees, we take our set of function symbols 
to be T := {®}, where <8> is an n-ary function symbol. For a set of variables V, there is an 
obvious bijection between F^(V) and the set of n-ary trees whose leaves are labelled by 
members of V. We denote the absolutely free term algebra generated by {<g>} on the set V 
by F®(V) and we denote the free monoid generated by V under concatenation by V*. 

Our basic strategy is to first realise F n> i as a structure group by constructing an 
equational theory £ such that [£] equates any two terms ii,i2 £ 1FV(V) that contain 
precisely the same variables in the same order and such that no variable appears more 
than once in either t\ or ti- In the binary case, this is achieved by imposing associativity. 
So, £ ought to be an analogue of associativity for n > 2. Once we have this realisation 
of F n< i we need only add the ability to arbitrarily permute variables in order to obtain a 
realisation of G Ut \ as a structure group. 

5.2.1. Catalan Algebras and F n ^. Associativity of a binary function symbol is 
sufficient to establish that any two bracketings of the same string are equal. The way in 
which one establishes this fact is to show that any bracketing of a string is equal to the 
left-most bracketing. So, for an n-ary function symbol to be associative, we need equations 
which imply that any bracketing of a term is equivalent to the left-most one. In order to 
simplify notation, for integers i < j, we use the symbol x\ to denote the list Xj, Xj+i, . . . , Xj. 
If * > j, then x\ is the empty list. 

Definition 5.2.1 (n-Catalan algebras). For n > 2, the theory of n- Catalan algebras 
consists of an n-ary function symbol (g> together with the following equations, where < 
i < n: 

We denote the theory of n- Catalan algebras by C n . 

The reason for the name of n-catalan algebras is that the set of all terms having k 
occurrences of the symbol <S> and containing precisely one variable is in bijective corre- 
spondence with the set of n-ary trees having k internal nodes, which has cardinality equal 
to the generalised Catalan number / n _A fc+1 ( n fc fc ), [Sta99]. The rather opaque equational 
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theory of n-Catalan algebras is rendered somewhat more understandable by viewing the 
induced equations on the term trees, which for n = 3, yields the following: 




X\ X2 X\ ■ X5 ■ X4 X5 




x 3 x 4 x 5 x 2 x 3 x 4 xi x 2 x 3 



In order to apply the strategy from the binary case to the n > 2 case, we need to define 
what we mean by the left-most bracketing of a term t. Intuitively, this is the term having 
the same variables as t in the same order, with all instances of <g> appearing at the left. 

Definition 5.2.2 (Underlying list). Let t G Fg>(V). The underlying list oft is the 
word of V* defined inductively by 

j-t /,\ fu(h)-...-u(t n ) ift = ®{h,...,t n ) 

U{t) = I 

t otherwise 

Given the underlying list of a term, we can define the left-most bracketing by recursively 
adding all instances of (g>. 

Definition 5.2.3 (Left-most bracketing). Let t G F (V). If U(t) =ti ■ ... ■ t n +fc(n-i); 
then the left-most bracketing oft is defined recursively by 

lmb(^ +fc(n_1) ) = lmb(<8>(t?),tSJ (n_1) )- 

Example 5.2.4. In the table below, the right-hand term is the left-most bracketing of 
the left-hand term. 





t 


lmb(t) 


n = 2 




®(a,®(®(b,c),d)) 


0(®(0(a,6),c),d) 


n = 3 




(g>(a,(g)(b, c, d),(g>(e,f, g)) 


(8>(®(<8)(a,6,c),d, e),/, 5) 


n = 4 




<g>(a, b, c, ®(<Z, e, g, h, i),j)) 


®(®(<8)(a, 6, c, d), e, /, 5), h, i, j) 



We wish to establish that any term F®(V) is equal, in Fc n (V), to its left-most brack- 
eting. To this end, we define a rewriting theory based on C n . 

Definition 5.2.5. C^* is the labelled rewriting theory consisting of an n-ary function 
symbol ®, together with the following reductions, where < i < n: 

The reduction rules of always move a term "closer" to its left-most bracketing. 
This observation is formalised in the following lemma. 

Proposition 5.2.6. is terminating and confluent. Given a term t G F®(V), its 
unique normal form in ¥c^(V) is given by lmb(t). 
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Proof. We construct a ranking function on F^(V), which establishes that Fc^*(V) is 
terminating, that for every term t £ Fg,(V) there is a reduction i — > lmb(i) and that lmb(i) 
is a normal form for t. We begin by defining the length of t. 



L(t) 



1 otherwise. 



Define the rank, of t inductively by setting = if t € V and 

n n , _ s 

«(«(*?)) = E + E(* - X ) L ^) - ^V^- 

i=l i=2 

We proceed by double induction on R(t) and L(t). If L(i) = 1 then the statement is trivial. 
If t = ®(t?) and t = lmb(t), then R(t) = R(h) + £™ =2 (* - 1) - = fl(ti) and it 

follows inductively that R(t) = 0. Conversely, if R(t) = 0, then t = lmb(t), since otherwise 
we would have L{ti) > for some 2 < i < n, from which it would follow that R(t) > 0. 

Suppose that Lit) > 1 and R(t) > 0, so that t = <S>(i"). Let i be the greatest integer 
with the property that ij ^ V. If i = 1, then i = lmb(i) by induction on L(t). If z > 1, 
then U = <S>(ki) and cti : t —> t' , where 

t' = ®(ti^®(ti-iX _1 )> 

We then have: 

R(t)-R(t') = 12(t i _i)+i2(®(«y)) + (i-2)L(t i _i) + (i-l)L((8)(uy)) 

-^(ti-i,^- 1 )) - ^K) - (» - 2)L(®(* i _i,«r 1 )) 

-(« - l)L(n n ) 

n n—1 n 

= Eo' - 1 ) L (%) + E l w - Eo' - ^K-i) 

j=2 j=l i=2 

= (n-l)L(n n ). 

Since L(u n ) > 1, we have R{t') < R(t) and the proposition follows by induction on 
R(t). □ 

Since each reduction rule in is a directed version of an equation in C n , we imme- 
diately have the following corollary. 

Corollary 5.2.7. For any t € F®(V), we have t = Cn lmb(i). □ 

In order to manipulate elements of Struct (C n ) effectively, we introduce the notion of 
a seed. 

Definition 5.2.8 (Seed). Let T he a graded set of function symbols on some set V and 
let p be a partial function Fjf(V) — > Fjr(V). ^4 seed /or p is a pair of terms s,t € Fjr(V) 
suc/i t/iat i/ie graph of p is equal to {(s^,^) | if £ [V, Fjr(V)]}. 
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In particularly nice cases, we can construct seeds for any operator in a structure 
monoid. 

Lemma 5.2.9 (Dehornoy [DehOO]). Let T be a balanced equational theory that contains 
precisely one function symbol. Then, each operator p £ Struct (T) admits a seed. 

It follows from Lemma 5.1.3 that C n is composable and we may, therefore, form the 
group StructG(C ra ). In order to facilitate the passage from members of Structc(C n ), to 
members of F n> i, we introduce the tree generated by a term. 

Definition 5.2.10. For a term t € F®(V), let T(t) denote the n-ary tree obtained via 
the following construction: 

• If t = . . . , t n ), then T(t) is equal to: 




T(h) T(t 2 ) ••• T{t n ) 
• Otherwise, T(t) is the single vertex • 

We now have all the tools required to show that F n> i is the structure group of n-catalan 
algebras. 

Theorem 5.2.11. Struct G (C n ) = F nA . 

Proof. We denote the seed of p G Struct G (C n ), which exists by Lemma 5.2.9, by 
(s p , t p ). We claim that the following map is an isomorphism: 

Struct G (C7 n ) F n>1 

p [(T(s p ),T(tp),id)] 

It is routine to see that 6 is a homomorphism. Suppose that p, p' G Struct G (C n ) and 
that @(p) = @(p'). It follows that p and p' have the same seed, so p = p' and is faithful. 

By Lemma 3.1.6, in order to establish that is surjective, we need only show that 
ti =c„ t2 whenever ti,t2 G F®(V) and U(ti) = Ufa). By Corollary 5.2.7, we have 
ti =c n hnb(ti) =c n lmb(t2) =c„ so is surjective and, hence, an isomorphism. □ 

5.2.2. Symmetric Catalan Algebras and G n ,\- We saw in Section 5.1 that the 
leaves of a tree may be ordered by the lexicographic ordering on their addresses. An n- 
ary tree symbol [(Ti, T2, <r)] may thereby be viewed as a pair of tree diagrams, together 
with a permutation of the leaves of T±. Thus, in order to obtain an equational theory 
whose structure group is G Ut i we need to add the ability to arbitrarily permute variables 
in Catalan algebras. Recalling that the symmetric group is generated by transpositions of 
adjacent elements, we are led to the following definition. 
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Definition 5.2.12 (Symmetric n-Catalan Algberas). The theory of symmetric n- 
catalan algebras extends that of n-catalan algebras with the following equations, where 
1 < i < n: 

j x ii Xj+i, Xi+2) = ®{ x \ i x i+li x ii x i+2)- 

We denote the theory of symmetric n-catalan algebras by SC n . 

Symmetric n-catalan algebras essentially add an action of the symmetric group on 
the indices of <8>. In general, this is sufficient to induce an action of a symmetric group 
on the variables of any term in F^,(V). In the binary case, we recover the definition of 
commutative semigroups. 

Theorem 5.2.13. Struct G (5C n ) ^ G n>1 . 

Proof. For p <G Structc(5C n ), let (s p ,t p ) represent its seed, which exists by Lemma 
5.2.9. Since SC n is linear, s p and t p are linear and supp(s p ) = supp(t p ). Let ir(p) be the 
permutation of supp(s p ) induced by the permutation U(s p ) — > U(t p ). A similar argument 
to the proof of Theorem 5.2.11 establishes that the following map is an isomorphism: 

Struct G (C„) — > G n ,i 

p ^ [(T(s p ),T(t p ),7r(p))} 

□ 

We now know that F nj i and G n) \ are the structure groups of Catalan algebras and of 
symmetric Catalan algebras, respectively. We also know that if we can construct coherent 
categorifications of these algebras, then we can apply Theorem 3.2.6 to obtain presentations 
of these groups. In the following section, we set about the task of constructing a coherent 
categorification of Catalan algebras. 

5.3. Catalan categories and F n> i 

In order to obtain a presentation for Structc(C n ) and, hence, for F n< i along the lines of 
that provided by Dehornoy for F [Deh05], we need to obtain a coherent categorification 
of C n . The immediate problem is discerning a set of diagrams whose commutativity imply 
the commutativity of all diagrams generated by the categorification. As we shall see in this 
section, the following definition suffices for this purpose. While the coherence axioms that 
we have chosen may seem slightly cryptic, the reason for their choice will become apparent 
in the proof that the resulting categorification is coherent. We shall make frequent use of 
the following useful shorthand: For 1 < i < n and a morphism p : ti — > t' { , we set 

® < (p) = ®(lti,---,lt i _i,P. Vi, •••,!*„ )■ 

Definition 5.3.1. The rewriting 2-theory for n-catalan categories is denoted C n and 
consists of: 

• An n-ary function symbol ®. 



5.3. CATALAN CATEGORIES AND F„,i 

For 1 < i < n, an invertible reduction rule cti of the following form: 
a^ 1 ) : ®(xi > ®(xj+?) >a ^ 1 ) ^ OCxj- 1 ,®^- 1 ),^ 1 ) 



50 



Pentagon axiom: For 1 < i < n — 1, the following diagram commutes, where X = x\ 
and Z = z\ 



i-i 



n—i—l 



l+1 («n-l) 



(X,®( y r 1 ,®(y% l - 1 )),y2n,Z) 



l (a n _i-...-ai) 



Adjacent associativity axiom: For 1 < i < n — 2 , the following diagram commutes, 
where X = x 1 ^ 1 and Z = z™~ l ~ 2 : 

®{x,y 1 My n 2 +1 )My 2 n + + 2),z) 

®{xMyiUn^®{y 2 n£),z) 

®(x, vi, ®(®(z/2 +1 ), yl n +2 ), y2n+i,z) 

^My n i)MylriUl n n + \z) 

®{x, ®{yiMy2 +1 Wn+2 l ),ylr\ z) 
®{xM®W),y 2 n + \ l ),yl n n + \z) ^^n^l 

In the case where n = 2, the pentagon axiom reduces to Mac Lane's pentagon axiom 
for monoidal categories from Example 2.1.9 and the adjacent associativity axiom is empty, 
so we recover the usual definition of a coherently associative bifunctor. 

In the special case where n = 3, the adjacent associativity axiom leads to a single 
coherence axiom, illustrated by the following diagram. The other axioms given in this 
chapter may be unpacked in this special case in a similar manner. 
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®(Vl, ®(V2, 2/3, 2/4), <8>(2/5, ye, 2/7)) 



1,2/2, 2/3), 2/4, 0(2/5, 2/6, 2/7)) 



,2/2, 2/3), 0(2/4,2/5,2/6), 2/7) 




2, 2/3, 2/4), 2/5, ye), 2/7) 



I,0(y2,y3,y4),y5),ye,y7) 



0(0(0(yi, z/2, y3), y4, ys), ye, y7) -fr-"-"® 1 ^) 



We wish to apply Theorem 4.3.3 to C ra in order to show that it is a coherent cate- 
gorification of C n . In order to do this, we need to find a positive orientation of C n that 
is terminating and locally commuting-confluent. Let be the positive subtheory of ^ n 
that contains aij for < i < n. This is equivalent, as a rewriting theory, to the system 
introduced in the last section. Therefore, we know from Proposition 5.2.6 that is 
terminating. So, it remains to show that C^* is locally commuting-confluent. 

Lemma 5.3.2. C^* is locally commuting-confluent. 



Proof. By Lemma 4.2.6, we only need to show that every critical span in 
commuting-joinable. Suppose that Qj and ctj overlap, where 

-1 o/ + i+n-l> 



IS 



1, 



i+n\ j.2n—l 
+1 / ' H+n+1 

.2n-l 



L\ j.2n— 1\ 
)' L i+n )• 



Suppose first that ®(t\, 0(t-J™ ), t-+~|x) is an a,-reduct. For the overlap to be nontrivial, 
we must have j 7^ i. If j 7^ i =b 1, then the critical span arising from the overlap is 
commuting-joinable by naturality. If j = i ± 1, then the critical span arising from the 
overlap is commuting-joinable by the adjacent associativity axiom. If the overlap arises 
because 0(t*^™) is an aj-reduct, then there are two possibilities. If j 7^ n, then the critical 
span arising from the overlap is commuting-joinable by naturality. Otherwise, the critical 
span arising from the overlap is commuting-joinable by the pentagon axiom. The only 
other possible overlap arises when some t^ is an aj-reduct. In this case, the critical span 
arising from the overlap is commuting-joinable by naturality. □ 

Since C^* is terminating and locally commuting-confluent, we may apply Theorem 
4.3.3. 



Theorem 5.3.3. C n is a coherent categorification of C n . 



□ 
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With Theorem 5.3.3 in hand, we can obtain a presentation for F n i, which generalises 
the presentation for F given in [Deh05]. 

Corollary 5.3.4. S G (C n ) = F n>1 

Proof. By Theorem 5.3.3 and Theorem 3.2.6, we have S G (C n ) ^ Struct G (C n ). It 
follows then from Theorem 5.2.11 that 8 G (C n ) = F n \. □ 

In the following section, we shall obtain a coherent categorification of SC n and, thereby, 
a presentation of G n ,i- 

5.4. Symmetric Catalan categories and G n< i 

Our goal in this section is to construct a coherent categorification of symmetric Cata- 
lan algebras. The coherence theorem for Catalan categories, Theorem 5.3.3, reduces this 
problem to ensuring that any two sequences of transpositions of the objects appearing in a 
term realise the same permutation. In other words, our categorification needs to somehow 
encode a presentation of the symmetric group whose generators correspond to transposi- 
tions of adjacent variables. Such a presentation is well known, having been constructed by 
Moore [Moo96]. This presentation has generators Ti, . . . , T n _i and the following relations: 

if = 1 for 1 < i < n - 1 
(TiTi+i) 3 = 1 for 1 < i < n - 2 
(TjT k ) 2 = 1 for 1 < i < k - 2 

With this presentation in mind, we may now construct a reasonable categorification of 
SC n . Recall our shorthand that for 1 < i < n and a morphism p : ti — ► t'^ we have 

®*(p) = ®(l tl ,...,l ti _ 1 ,p, l ti+1 ,...,lt„). 

Definition 5.4.1. For n > 2, the rewriting 2-theory for symmetric n-catalan cate- 
gories, denoted SC n; is the extension of the theory for n-catalan categories with an invert- 
ible reduction rule Ti for 1 < i < n — 1 such that 

: ®(ii _1 )*i)^+l)*f+2) y ®(*i -1 )*i+l)*i)*i+2)) 

satisfying the following axioms: 

Involution axiom: For 1 < i < n — 1, the following diagram commutes: 

®(t?) 

i 

^ 7. ®{A~\ti + iit l ^ +2 ) 

Compatibility axiom: For 2 < i < n and 1 < j < n — 2, the following diagram commutes, 
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®(VF,®(x,y™ l ),y n ,Z) ®{W,x,®(y{ 1 ,y j+1 ,y j ,yJ +2 ),Z) 

3-cycle axiom: For \ < i < n — 2, the following diagram commutes: 




{t\ i U+l, U, i^|-2 




T»+l 



i— 1 / / + + n 



n+i 



1 j ti+2, ti+1, U, tf +3 ) 



n—i—1 



Hexagon axiom: For 1 < % < n — 1, the following diagram commutes, where W = w\ 1 
and Z = z" 

®{W,®{x J l),y,Z) 



®(W,j/,®(a;?),Z) 



0(W,^(y,x^ L ),x n ,Z) 



®(W,X!,®(x^,y),Z) 

5 I+1 (r„-i-...-r 1 ) 

®(W, Xl ,®(y,x%),Z) 



®(W,®(xi,y,x5 -1 ),a; n ,Z) 
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The hexagon axiom ensures that we may replace a transposition of the form Tj(t^ -1 , ®(-u™), t™ 1 ) 
with a sequence of transpositions involving only the terms and u™. One might posit 
the commutativity of a diagram that serves the same purpose for a morphism of the form 
Ti(t\, <8>(ui ), t™+i). Doing so leads to the dual hexagon diagram, which has the following 
form, for 2 < i < n and W = w{~ 2 and Z = z™" 1 : 

®(W,x,®(i/f),Z) 



®(w, z) ®(w, ®(x, <- 1 ), y n , Z) 

(g. i (r 1 -...-r„_i) 

®(W, yi, x), Z) ®{W, ®«-\x), y n , Z) 

Lemma 5.4.2. The dual hexagon diagram commutes in F(SC n ). 

Proof. Tracing around the dual hexagon diagram, we obtain the following morphism: 

(5.1) a; 1 ■ i+1 (r n _i) • Oi ■ ®\T! ■ . . . ■ th-i) -1 • a- 1 . 

In order to show that the dual hexagon diagram commutes, we need to show that (5.1) = Tj. 
By functoriality, we have: 

®*(Ti • • • • T^i)- 1 = ^(n)" 1 • . . . • &(T n ^)-\ 

By functoriality and the involution axiom, we have ®*(Tj) _1 = ® % {rj). From the com- 
patibility axiom, we also know that ® 1 (tj) = a~ x ■ ®* +1 (tj_i) ■ aj. It follows from these 
observations that: 

(5.2) (5.1) = a" 1 • ®*+Vn-i) • • • • • ■ " ■ ®Vi) ■ « _1 

(5.3) = ar 1 .® i + 1 (r n _i-...-ri)-a i -® i (ri)-ar 1 

(5.4) = a- 1 -® i+1 {r n - 1 -...-T 1 )-a i -® i {r 1 )- 1 -a-\ 

It follows from the hexagon axiom that (5.4) = T{. By the involution axiom, we then have: 

U ■ a' 1 • ® i+1 (m-i) • at ■ ®*(ti • . . . • r n _i) _1 • a' 1 = 1. 

Therefore, the dual hexagon diagram commutes in F(SC n ). □ 

In the n = 2 case, the axiomatisation of SC n reduces to the theory of a coherently 
associative and commutative bifunctor given in Example 3.2.7. The main result of this 
section establishes that SC„ is a suitable generalisation of this case. 
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Theorem 5.4.3. SC n is a coherent categorification of SC n . 

Proof. By Theorem 5.3.3 and Corollary 2.5.5, we may assume that all of the associa- 
tivity maps are strict equalities. Thus, an object of F(SC ra ) may be represented as 
where each ij is a variable and m = n + k(n — 1), for some k > 0. Lemma 5.4.2 and 
the hexagon axiom imply that it suffices to consider transpositions of adjacent variables. 
So, for a given object t := <S>(t™), we need only consider the m — 1 induced transposition 
natural isomorphisms 

Ti(t™) : <8>(ti -1 ,ii,ti+i,t™ 2 ) — ¥ ®(*i -1 )*i+i)*i)*i+2)- 

In order to establish coherence, we have to show that every permutation of t™ is unique. 
That is, we have to show that the induced transposition maps satisfy the defining relations 
for the symmetric group of order m. 

The compatibility axiom implies that each Tj is unique. By the naturality of the maps 
Tj, we have Tj • = • Tj for all 1 < % < k — 2. The involution axiom implies that Tf = 1. 
Thus, it only remains to establish that (Tj • Tj + i) 3 = 1. For n = 2, we may use the proof 
from Mac Lane [ML63]. Suppose that n > 3. Since the associativity maps are taken 
to be strict equalities, we may assume that t has the form ®(R,®(S,ti,ti+i,ti+2iU),V), 
where R, S, U and V are sequences of variables. The result then follows from the 3-cycle 
axiom. □ 

We can now construct a presentation of Struct^ (SC n ) and, therefore, of G rii \, which 
generalises the presentation for V given in [Deh05]. 

Corollary 5.4.4. S G (SC n ) ^ G n>1 

Proof. By Theorem 5.4.3 and Theorem 3.2.6, we have S G (SC n ) ^ Struct^ (SC n ). It 
follows then from Theorem 5.2.13 that S G {C n ) = G n>1 . □ 

In this chapter, we have seen how the seemingly abstract general coherence theorems 
developed in Chapter 4 can have very powerful applications. Indeed, the proof that C n is 
a coherent categorification of C n was relatively routine. Unfortunately, not all rewriting 
2-theories are of the form required for the theorems from Chapter 4 to be applicable. 
In the following chapter, we develop more general coherence theorems that relax those 
assumptions somewhat. 



CHAPTER 6 



Coherence for incomplete theories 

In Chapter 4, we developed a general Mac Lane coherence theorem for terminating 
and confluent rewriting 2-theories. This result has wide applicability, including the inves- 
tigation of Catalan categories presented in Chapter 5. 

Unfortunately, it is simply not the case that every coherent rewriting 2-theory has 
unique normal forms. For instance, the theory consisting of a unary function symbol F 
and the single reduction rule F(x) — > F(F(x)) is non-terminating, but easily seen to be 
coherent. A stronger counterexample to the hope that coherent structures have unique 
normal forms is provided by the theory of iterated monoidal categories [BFSV03] whose 
coherence problem we investigate in the following chapter. These structures arise as a 
categorical model of iterated loop spaces and fail to be confluent, so the tools of Chapter 
4 do not apply. 

We are thus faced with the problem of determining sufficient conditions for coherence 
in terms of the underlying rewriting system of a 2-theory that do not rely on either termi- 
nation or confluence. This leads to the related problem of determining whether, for any 
finitely presented labelled rewriting theory, there is always a finite set of diagrams whose 
commutativity implies the commutativity of all diagrams built from the theory. 

This chapter sets out to solve several related coherence questions by vigourously pur- 
suing the idea that two morphisms with the same source and target in a free covariant 
structure on a discrete category commute precisely when they admit a planar subdivision 
such that each face is an instance of naturality, or of functoriality or of one of the coher- 
ence axioms. The guiding intuition behind this approach is that a span that cannot be 
completed into a square can never appear in such a subdivision. 

Section 6.1 lays the foundations for this chapter by providing precise definitions of 
the various concepts related to subdivisions of parallel pairs of arrows and determining 
conditions that ensure that each parallel pair of arrows has only finitely many subdivisions. 
This quickly leads, in Section 6.2, to a general Lambek coherence theorem. Section 6.3 
provides a more refined analysis of the possible subdivisions of a parallel pair of reductions 
in a finitely presented labelled rewriting theory and exploits this analysis to obtain a 
general Mac Lane coherence theorem. Finally, Section 6.4 constructs examples of labelled 
rewriting theories that cannot be made coherent via only finitely many coherence axioms. 
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6.1. Subdivisions 

When one is working with rewriting 2-theories or categorical algebraic structures more 
generally, one typically draws diagrams representing morphisms in the free structure. The 
purpose of this section is to formalise these diagrams as ambient isotopy classes of pla- 
nar directed graphs. This provides a mathematical setting for the manner in which one 
typically shows that a parallel pair of morphisms is equal: by finding a subdivision of the 
pair whose faces commute by virtue of functoriality, naturality and the coherence axioms. 
Within this setting, we examine properties that the underlying rewriting theory of a 2- 
theory must satisfy in order to ensure that each parallel pair of morphisms admits only 
finitely many such subdivisions. This forms the basis for the coherence theorems developed 
in the remainder of the chapter. 

A subdivision of a parellel pair of reductions is, in the first instance, a collection of 
reductions having the same source and target. This collection forms a graphical structure. 

Definition 6.1.1. An st-graph is a labelled directed graph G (possibly with loops and 
multiple edges) together with two distinguished vertices u and v, called the source and target 
of G respectively, such that for any other vertex w € G, there exist paths u — > w and w — > v 
in G. 

By Lemma 2.6.3, we know that every reduction generated by a rewriting 2-theory is a 
composite of singular reductions. Before we introduce the graph associated to a labelled 
rewriting theory, we need to deal with a subtlety that arises due to the presence of an 
equational theory on terms. Let 7Z := (J 7 ; T \ £jr • £q-) be a labelled rewriting theory. 
By the functoriality of the function symbols F G T , every equation in [£jP\ induces an 
equation on reductions. Thus, we may form the quotient Sing(7£)/[£y]. We call a member 
of Smg(lZ) / £jr an absolutely singular reduction. 

Definition 6.1.2 (Reduction graph). Let Jzf := {T;7 \&t)x be a labelled rewriting 
theory. The expression Red(Jzf) denotes the reduction graph of Jzf. This graph has 

• Vertices: The set | ^ (X) . 

• Edges: Absolutely singular reductions in ¥^(X). 

The reduction graph of a rewriting 2-theory is the reduction graph of its underlying labelled 
rewriting theory. 

A subdivision corresponds to a particular way of embedding an st-graph in the oriented 
plane. Given a graph G, we use \G\ to denote its geometric realisation. We write M 2 for 
the plane with the clockwise orientation. We use G(s, t) to denote the set of paths from s 
to t in G. 

Definition 6.1.3. Let G be a graph and a,(3£ G(s,t). A pre-sub division of {a, (3) is 
a pair (S, (p) such that: 

(1) S is an st-graph with source s and target t. 
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(2) {a, 0} C5CG. 

(3) tp : \S\ ^> M 2 is a planar embedding. 

(4) For every edge 7 G S, the image ItI ) contained in the region ofM 2 bounded 
by (p(\a\) and <p{\P\). 

We use PSubc(a, (3) to denote the set of all pre- subdivisions of (a, [3) in G. 

The definition of pre-subdivisions admits too many different embeddings of the same 
graph. To this end, we define a usefui equivalence relation on pre-subdivisions. In the 
present context, we say that two embeddings /, g : G M 2 are ambiently isotopic if there 
is an isotopy h of the identity map of R 2 such that h\f = g. In other words, / and g are 
ambiently isotopic if they differ only by a continuous deformation of M 2 . Intuitively, / and 
g are ambiently isotopic when they differ only by the size and shape of their faces. 

Given a graph G and a, (3 G G(s,t), let (Si,(p) and (S2,ip) be pre-subdivisions of 
{a, (3). Define ~ to be the equivalence relation on PSubc(a,/3) generated by setting 
(Si, if) ~ (S 2 ,ip) if: 

(1) Si = S 2 . 

(2) cp and ip are ambiently isotopic. 

Ambient isotopy is still not quite enough to identify all subdivisions representing the 
same categorical diagram. The reason for this is that reflecting the plane about some axis 
maps a subdivision to an equivalent categorical diagram. Let E{2) be the Euclidean group 
of the plane — the group of all rotations, translations and reflections of the plane. 

We write Sub G (s,t) for the quotient (PSub G (s, t)/~)/E(2) . 

Definition 6.1.4. For a directed graph G and a, (3 G G(s,t), a subdivision of (a, (3) 
is a member of Subc(s,t). For a labelled rewriting theory jSf , a subdivision of a parallel 
pair of reductions a, (3 £ F(Jz?) is a subdivision of (a, (3) in Red(«5f). The set of all such 
subdivisions is denoted Sub^f (a, (3). 

Recall that a directed graph G is locally finite if G(s, t) is finite for all vertices s,t G G. 
The following sequence of lemmas establishes a correspondence between local finiteness 
and finitely many subdivisions. 

Lemma 6.1.5. For a directed graph G and a finite planar st-subgraph S < G(s,t) with 
source s and target t, there are only finitely many subdivisions of a, (3 G G(s, t) having 
graph S. 

Proof. Since we only consider embeddings of S up to ambient isotopy and Euclidean 
group action, a subdivision with graph S is completely determined by the set of edges 
mapped to the region bounded by v?(l7il) an d ( / 3 (l72|) for every parallel pair of paths 
7i,72 G S. Since S is finite, there are only finitely many possibilities for this. □ 

Lemma 6.1.6. An st-graph with source s and target t is finite if and only if it has 
finitely many planar st-subgraphs with source s and target t. 
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PROOF. (=>) A finite graph has finitely many subgraphs, so it certainly has finitely 
many planar subgraphs. 

(•<=) Suppose that G is an infinite st-graph with source s and target t. Each path 
from s to t in G determines a planar subgraph of G, hence G has infinitely many planar 
subgraphs with source s and target t. □ 

Combining Lemma 6.1.5 and Lemma 6.1.6, we obtain the desired correspondence. 

Lemma 6.1.7. If G is a directed graph containing vertices s andt, then G(s,t) is finite 
if and only if Subcia, (3) is finite for all a, (3 G G(s, t) □ 

6.1.1. Ensuring local finiteness. By Lemma 6.1.7, in order to ensure that every 
parallel pair of paths in a directed graph has finitely many subdivisions, we need only 
establish that the graph is locally finite. To this end, we make the following definition. 

Definition 6.1.8. Let G be a directed graph. A quasicycle in G is a pair (T,t) such 
that: 

(1) T is an infinite chain to — > t\ — > . . . in G 

(2) t is a vertex in G. 

(3) G contains a path ti — > t for all i G N. 



to 1\ s- t 2 >- t 3 >■ ti 




Figure 1 . A quasicycle 

Quasicycles earn their name by being a slightly weaker notion than a cycle. Figure 
1 gives an example of a quasicycle that is not a cycle. On the other hand, we have the 
following easy result. 

Lemma 6.1.9. Let C be a directed cycle and c be a vertex in C. Then, (C, c) is a 
quasicycle. □ 

For a directed graph G and a vertex s £ G, we use Outc(s) to denote the set {t € 
V(G) : G contains an edge s — ► t}. We say that G is finitely branching if Outc(s) is finite 
for all vertices s G G. One of our main technical tools is the following graphical version of 
Konig's Tree Lemma. 

Lemma 6.1.10. A finitely branching directed graph is locally finite if and only if it 
contains no quasicycles. 
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Proof. Let G be a labelled finitely branching directed graph. 

(=>) Suppose that G contains a quasicycle (T,t), where T = to ^ t\ — » . . . . If ti = t 
for some i G N then G(ti,tj) is infinite for all j > i. So, suppose that U ^ t for all i G N. 
Since U — > t for all i £ N, there must be infinitely many pairs (i,(3i), where i 6 N and 
(3i : tj — > t is a path that does not factor through for any j > i So, G(to, t) is infinite. 

(<=) Suppose that G(s,t) is infinite. Since Outc(s) is finite, it follows from the pigeon 
hole principle that there must exist some sq € Outc(s) and an edge ao : s — > sq such that 
G(so,t) is infinite. Continuing recursively, we obtain an infinite chain s ^> so si -i . . . 
such that G contains a path — > t for all i £ N. So, G contains a quasicycle. □ 

By making use of the reduction graph of a labelled rewriting theory, we can shift our 
terminology for directed graphs to labelled rewriting theories. 

Definition 6.1.11. A labelled rewriting theory jSf is quasicycle-free if every quasicycle 
in Red(j§? ) contains cofinitely many identity reductions. It is locally finite if Red(Jz? ) is 
locally finite and it is finitely branching if Red(Jz? ) is finitely branching. 

Recall that an equation s = t is called balanced if s and t contain precisely the same 
variables and it is called linear if it is balanced and each variable appears precisely once 
in each of s and t. 

Definition 6.1.12. A labelled rewriting theory ££ := (T;T\£p) is term-linear if 
contains only linear equations. 

A reduction rule a : [s] — > [t] is called non-increasing if Var(i) C Var(s). A labelled 
rewriting theory Jz? is non-increasing if every reduction rule in Jz? is non-increasing. 

Proposition 6.1.13. A finitely presented labelled rewriting theory is finitely branching 
if it is term-linear and non-increasing. 

Proof. Let Jz? := {J-;T \£p) be a finitely presented non- increasing term-linear la- 
belled rewriting theory. Without loss of generality, we may assume that T = {p}. Suppose 
that the vertex [s] in Red(Jzf ) is infinitely branching. Since p is non-increasing, there must 
be infinitely many terms si, S2, ■ ■ ■ € [s] containing the same number of unary and binary 
function symbols as p, such that each Sj contains a p-redex as a subterm. But this is 
impossible, since £? is linear. □ 

A labelled rewriting theory that is not term-linear may be infinitely branching, even if 
it is finitely presented and non-increasing. 

Example 6.1.14. Let Jz? be the labelled rewriting theory consisting of the binary func- 
tion symbol T , the equation s = F(s, s) and the reduction rule p : t — > t' . Then, in F(Jzf), 
we have: 

t = F(t, t) = F(F(t, t),t) = F(F(F(t, t),t),t) = ... 
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The reduction rule p induces maps from [t] to: 

(6.1) t', F(t', t), F(F(t', t),t), F(F(F(t' , t),t),t), . . . 

Since the terms in (6.1) are pairwise unequal, Jz? is infinitely branching. 

Lemmas 6.1.7 and 6.1.10 imply that a finitely branching quasicycle-free labelled rewrit- 
ing theory has only finitely many subdivisions for every parallel pair of reductions. A ready 
supply of such theories is provided by the following observation, which follows immediately 
from the definitions. 

Lemma 6.1.15. A terminating labelled rewriting theory is quasicycle-free. □ 

By Lemma 6.1.9, a quasicycle-free directed graph is acyclic. The following theorem 
establishes that every face of a subdivision in an acyclic graph is itself a parallel pair 
of paths. It was originally discovered by Power [Pow90] in his investigation of pasting 
diagrams in 2-categories. 

Theorem 6.1.16 (Power [Pow90]). A planar st-graph is acyclic if and only if every 
face has a unique source and target. □ 

Theorem 6.1.16 readily leads to the following result by induction over the number of 
faces in a subdivision. 

Proposition 6.1.17. Let 1Z := (F\T \Et ; £t) be an acyclic rewriting 2-theory and 
let a, (3 € Red(7£)(s, t). Then, the following statements are equivalent: 

(1) a = in¥(K). 

(2) There is a subdivision of (a, (3) in Red(7£)(s, t) such that each face commutes in 
F(K). 

(3) There is a subdivision of (a, (3) in Red(7£)(s, t) such that each face is either an 
instance of functoriality, or an instance of naturality or an instance of one of the 
equations in □ 

In the following section, we use the tools developed so far to tackle the Lambek coher- 
ence problem. 

6.2. Lambek coherence 

With Proposition 6.1.17 and Lemma 6.1.7, one may be inclined to think that a Lam- 
bek coherence theorem should be immediately forthcoming, since we know that every 
quasicycle-free finitely branching rewriting 2-theory has only finitely many subdivisions 
for each parallel pair of reductions and we can just check every face to see whether it is an 
instance of functoriality, naturality or a cohenrence axiom. There is, however, one catch 
- we may not be able to decide whether a given face is an instance of an axiom. 
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Definition 6.2.1 (Unification). Let J- be a ranked set of function symbols on a set X 
and £? be an equational theory on Wp(X). An -unification problem is a finite set: 

r = {(si,h), (s n ,t n )}, 

where for 1 < i < n, we have that Sj and U are in Fjr(X). A unifier for T is a homomor- 
phism o '. X — ► Fjf(X) such that cr(sj) =£ F o-(U) for all 1 < i < n. The set T is unifiable 
if it admits at least one unifier. 

Unification theory is an important technical component of automated reasoning and 
logic programming, as it provides a means for testing whether two sequences of terms are 
syntactic variants of each other. A good survey of the field is provided by [BS94]. In 
the case where the theory £p is empty, the unification problem is readily shown to be 
decidable (see [BS94] for details). Unfortunately, the equational unification problem is in 
general undecidable. 

Definition 6.2.2. A labelled rewriting theory {F;T \ ; Et) has decidable term uni- 
fication if (J 7 1 Ejr) has a decidable unification problem. 

We can finally establish a general Lambek coherence theorem. 

Theorem 6.2.3 (Lambek Coherence). A finitely branching quasicycle-free rewriting 
2-theory with decidable term unification is Lambek Coherent. 

Proof. Let 1Z be a rewriting 2-theory satisfying the hypotheses and let a, (3 G Red(7£)(s, t). 
By Lemma 6.1.7, we can enumerate the subdivisions of (a, (3). Since each subdivision has 
only finitely many faces and 1Z has decidable term unification, we may apply Proposition 
6.1.17 to determine whether every face of a subdivision commutes in ¥{1Z). □ 

Unfortunately, we may not be able to determine whether a labelled rewriting theory 
is quasicycle-free. 

COROLLARY 6.2.4. It is undecidable whether a finitely branching rewriting 2-theory 
theory with decidable term unification is quasicycle-free. 

PROOF. The rewriting 2-theory constructed in the proof of Theorem 2.4.6 has an 
empty equational theory on terms and so has decidable term unification. It follows from 
Theorem 6.2.3 that, were we able to determine whether the theory is quasicycle-free, then 
we would be able to decide whether a finite monoid presentation has a decidable word 
problem. □ 

As a particular application of Theorem 6.2.3, any terminating rewriting 2-theory with 
an empty equational theory on terms is Lambek coherent. This includes, amongst others, 
categories with a directed associativity [Lap72a]. The unification problem for an asso- 
ciative binary symbol <8) together with an identity I for (g> is decidable [BS94]. It follows 
then, from Theorem 6.2.3 that the following rewriting 2-theories are Lambek coherent (in 
each case we need only check that the 2-theory is terminating): 
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• Distributive categories with strict associativities and strict units [Lap72b]. 

• Weakly distributive categories with strict associativity and strict units [CS97]. 
An example of a non-terminating theory that is Lambek-coherent is provided by the 

system F(x) — > F(F(x)), since this is easily seen to be quasicycle-free. 

In the following section, we continue our investigation of quasicycle free theories and 
derive sufficient conditions for such a system to be Mac Lane coherent. 



The last section was concerned with deciding whether a given parallel pair of morphisms 
is equal or, equivalently, whether a given diagram in general position commutes. Our 
rough goal in this section is to find a minimal set of diagrams in general position whose 
commutativity implies the commutativity of all other such diagrams in W(1Z) for some 
rewriting 2-theory TZ. To this end, we define what it means for one subdivision to be finer 
than another. The driving idea is that we only wish to consider those subdivisions that 
do not embed into a finer subdivision. 

Definition 6.3.1. Let G be a directed graph and a, (3 G G(s,t) and (Si,(p),(S2,ip) G 
Sublet, (3). We say that (Si,ip) is coarser than (S^VO if there is a graph embedding 
A : S\ — > S2 making the following diagram commute. In this case, we also say that (S2, Y>) 
is finer than (S\,ip) and we write (Si,ip) ^ (S^VO- 



We define the refinement order to be the antisymmetric closure of ^ . 

It is immediate from the definitions that the set of subdivisions of a parallel pair 
of morphisms forms a poset under refinement. We shall abuse notation slightly in the 
following definition and write ^ for the refinement order. 

Definition 6.3.2. Let G be a directed graph and a, (3 G G(s, t). A maximal subdivision 

of (a, (3) is a maximal element of (Sub(j(a!, 0), ■<). 

The idea behind the definition of a maximal subdivision is that these are precisely the 
ones which cannot be further subdivided. This leads to the following lemma. 

Lemma 6.3.3. A finitely branching quasicycle-free rewriting 2-theory is Mac Lane co- 
herent if and only if every parallel pair of reductions in general position admits a maximal 
subdivision, each face of which commutes. 

Proof. The direction (<=) follows from induction over the number of faces. For the 
other direction, let TZ be a finitely branching quasicycle-free rewriting 2-theory. Let a, (3 G 
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Redfc(s,t). Since 7Z is quasicycle-free and finitely branching, it follows from Lemma 6.1.7 
that Subn(a, (3) is finite. Therefore, (a, (3) admits a maximal subdivision (S,(p). By 
Theorem 6.1.16, every face of (5, ip) has a unique source and target. Since 1Z is Mac Lane 
coherent, each of these faces commutes. □ 

In order to make Lemma 6.3.3 effective, we need to characterise those parallel pairs of 
morphisms that can occur as faces of a maximal subdivision. 

Definition 6.3.4 (Zig-zag subdivision). Let G be a directed graph and a, (3 E G(s,t). 
Suppose that 

ao a\ a-n-l a n 

a = s — > ao — > • • • — ► a n -i —> t 

n A) , |9l /3m- 1 , f3m , 

/3 = s -> b -> ■ • ■ -> 6 m _i -> t 

and that each cti and is singular. Let U be the forgetful functor from directed graphs to 
graphs that forgets the direction of edges. A zig-zag subdivision of {a, (3) is a subdivision 
(S,ip) of {a, (3) such that U(S) contains a path from U(ai) to U(bj) for some pair (i,j), 
with < i < n — 1 and < j < m — 1. We call the preimage of this path the zig-zag of S. 




Figure 2. A few zig-zag subdivisions. 



Definition 6.3.5 (Diamond). Let G be a directed graph. A pair a, (3 £ G(s,t) is called 
a diamond if it does not admit a zig-zag subdivision. 

The idea behind the definition of a diamond is that any subdivision containing a face 
that admits a zig-zag subdivision cannot be a maximal subdivision. This is made precise 
in the following proposition. 

Proposition 6.3.6. Let G be an acyclic directed graph and a,(3e G(s,t). Every face 
of a maximal subdivision of (a, (3) is a diamond. 

Proof. Let G be an acyclic directed graph and let (S, ip) be a maximal subdivision of 
a,(3e G(s,t). By Theorem 6.1.16, every face of S has a unique source and target. That 
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is, every face consists of a parallel pair of reductions r/, ip : u — > v. Suppose that (77, ip) is 

a face of S that is not a diamond. That is, it admits a zig-zag subdivision. So, we have 

m m 
rj = u — > w — > v 

ip = u — > x — > V , 

and a zig-zag 7 between u> and x that is a part of a subdivision of (77, ip). By maximality, 
7 must be contained in S. Since (rj,ip) is a face, </?(l7l) cannot lie in the region bounded 
by <p(\rj\) and <^(|Y>|). So, we are in one of the situations depicted in Figure 3. 





Figure 3. Possible embeddings of 7. 



Suppose that we are in the situation depicted in the left hand diagram of Figure 3. 
Since 7 is contained in S and since S is an st-graph, there is a path s — > u. By planarity, 
8 must factor through a vertex in 7 or 7/2 or ip2- If S factors through a vertex in 772 or ip2 
then it is clear that G contains a cycle, contradicting the fact that G is acyclic. So, we 
must have s z ^ ii for some vertex 2 in 7. However, since 7 appears in a subdivision 
of (rj,ip), there is a path u z in G. Then, (52 • C forms a cycle in G, contradicting the 
fact that G is acyclic. So, 7 cannot be embedded as in the left hand picture of Figure 3. 
Dually, it cannot be embedded as in the right hand picture of Figure 3. 

Therefore, the zig-zag 7 must be embedded within the face bounded by (77, ip), contra- 
dicting the maximality of (S,<p). So, (77, V>) must be a diamond. □ 



Combining Lemma 6.3.3 and Proposition 6.3.6, we obtain our general version of the 
Mac Lane Coherence theorem. 

Theorem 6.3.7 (Coherence). A finitely branching quasicycle-free rewriting 2-theory 1Z 
is Mac Lane coherent if and only if every diamond in Red(7£) commutes in ¥(1Z). □ 

Theorem 6.3.7 says that in order to show that a finitely branching rewriting 2-theory 
is Mac Lane coherent, we need to do two things: 

(1) Show that ¥(TZ) is quasicycle-free. 

(2) Show that every diamond commutes. 

At the outset, showing that every diamond commutes can be a daunting task. We can 
guide our investigations by exploiting the properties of critical spans. 
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Definition 6.3.8. Let 1Z be a rewriting 2-theory and let ip\ and ip% be singular mor- 
phisms in F(1Z). We call the source span in a diagram of the following form: 



i>2 



■•P2 



If ipi and ipi are singular, then there are three possibilities for a diamond with source 
span (<£>i, Y^i): 

(1) ipi and ipi rewrite disjoint subterms. 

(2) ipi and ip\ rewrite nested subterms. 

(3) ip i and ipi rewrite overlapping subterms. Without loss of generality, we may 
assume that {(fi,ipi) forms a critical span. 

By analogy with Lemma 4.2.6, one may hope to reduce the problem to only examining 
diamonds whose source span is critical. Unfortunately, as the following two examples show, 
there may be more than one diamond whose source span performs a given pair of nested 
or disjoint rewrites. 

Example 6.3.9. In this example we construct a terminating rewriting 2-theory that has 
more than one diamond with the same source span performing a nested pair of rewrites. 
Let 1Z be the 2-theory consisting of unary functor symbols /, J and H , together with the 
following reduction rules: 



I(x) -> J(x) 
I(J(x)) - H(x) 
J(I(x)) - H{x) 

Then, F(7£) contains the following diagram: 

/ ^\ 

/(/(a)) J(J(o)) H(a) 

J(J(o)) ^ 

Since there is no reduction J(J(a)) — > H(a), both parallel reductions form diamonds. 

Example 6.3.10. In this example we construct a terminating rewriting 2-theory that 
has more than one diamond with the same source span performing a disjoint pair of 
rewrites. Let TZ be the rewriting 2-theory consisting of unary functor symbols I and J, 
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the binary functor symbol ® and the following reduction rules: 



I(x) -> J(x) 
J{x)®I{x) -> H(x) 
I{x) ® J(x) -> 



Then, ¥(TZ) contains the following diagram: 



I{A)®J(A) 




J(A)®I(A) 



J(A)®J(A) tf(A) 



\ 




J{A)®I{A) 



Since there is no reduction J (A) <S> J (A) — > i2"(.A), 6ot/i parallel reductions form diamonds. 

Examples 6.3.9 and 6.3.10 serve to warn us that the collection of diamonds behaves 
a lot more subtly than the collection of spans, which are the typical objects of study in 
traditional term rewriting theory. In the next section, we look at when a labelled rewriting 
theory cannot be made into a Mac Lane coherent rewriting 2-theory by only finitely many 
coherence axioms. 



In light of Theorem 6.3.7, we have a reasonable strategy for determining whether 
a given rewriting 2-theory is Mac Lane coherent. However, we are still left with the 
problem of determining whether a given finitely presented labelled rewriting theory can 
be extended to a finitely presented Mac Lane coherent rewriting theory. Bearing in mind 
the results of the previous section, we have two reasonable candidates for ensuring this 
property: quasicycle freeness and termination. In this section, we show that neither of 
these conditions suffice in general. 

Given a rewriting 2-theory 1Z := LT\T\Ej:\ £q-), we say that the labelled rewriting 
theory (JF; T \ is the reduct of 1Z. 

Definition 6.4.1 (Finitely Mac Lane coherent). A finitely presented labelled rewriting 
theory is finitely Mac Lane coherent if it is the reduct of a finitely presented Mac Lane 
coherent rewriting 2-theory. 

It is not a priori obvious whether there exist theories that are not finitely Mac Lane 
coherent. We can simplify our investigation of this point somewhat by focusing on the 
most basic diamonds. 



6.4. Finite Mac Lane coherence 



Definition 6.4.2 (Basic diamond). A diamond Ai appearing in the reduction graph 
of a labelled rewriting theory is basic if it satisfies the following properties: 
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(1) For every substitution a and diamond A2, if Ai = A2, i/ten a is a variable 
renaming. 

(2) For ewery unary functor F € F(Jzf) and every diamond A2, if F(/S.2) = A\, then 
F = 1. 

The following theorem is immediate from Theorem 6.3.7. 

Theorem 6.4.3. ^4 finitely presented, finitely branching labelled rewriting theory Jz? is 
finitely Mac Lane coherent if and only if Red(Jz? ) contains finitely many basic diamonds, 
up to variable renaming. □ 

In the following example, we construct an example of a quasicycle-free labelled rewrit- 
ing theory that is not finitely Mac Lane coherent. 

Example 6.4.4. Let Jzf be the labelled rewriting theory containing unary function sym- 
bols F, G, L and H, together with the following reduction rules: 

I(x) - G(I(x)) 

I(x) -► F(I(x)) 
F(x) -► F{F(x)) 
G(x) -»• G(G(z)) 
F(x) -► H(x) 
G(x) -»• 

In order to s/iow f/iaf =Sf is quasicycle-free, it suffices to show that there is no term t £ F(Jz?) 
suc/i that there are infinitely many reductions with target t in F(Jzf ). Let Jz? -1 be the labelled 
rewriting theory with the same function symbols as Jz? and a reduction rule t s for every 
reduction rule s — ► t in Jz?. By Proposition 6.1.13, Jz? -1 is finitely branching, so Jz? is 
quasicycle-free. 

However, F(Jz? ) contains the following diagram: 

G(I(a)) — G 2 (L(a)) — G 3 (/(a)) — • • • 




/(a) H(L(a)) tf 2 (I(a)) tf 3 (/(a)) 




F(/(a)) — F 2 (/(a)) — F 3 (/(a)) — • • • 

Since there are no reductions H l (a) — > H 3 (a) for i ^ j, no finite collection of diamonds 
with source 1(a) implies the commutativity of all others. So, Jz? contains infinitely many 
substitution-reduced diamonds and it follows from Theorem 6.4-3 that it is not finitely Mac 
Lane coherent. 
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Example 6.4.4 works by constructing infinitely many substitution-reduced diamonds 
sharing a common source span. Terminating rewriting theories are far better behaved. 

Lemma 6.4.5. Let if be a finitely branching terminating labelled rewriting theory. 
Then, for every term t € F(if), the set 

Out(t) = {t' € F(if) : 3 a reduction t -> t' in Jz^} 

is finite. 

Proof. Let if be a finitely branching terminating labelled rewriting theory and let 
t £ F(if). Suppose that Out(i) is infinite. Let Out(t) n be the set of terms t' £ Out(i) 
such that a path of minimal length t — ► i' in F(if) contains n edges. Since if is finitely 
branching and Out(t) is infinite, Out(i) n is finite and nonempty for all n G N. This implies 
that if is not terminating, contradicting our assumptions. Thus, Out(t) is finite. □ 



It follows from the above lemma that there are only finitely many substitution-reduced 
diamonds with a given source span in a finitely branching, terminating labelled rewriting 
theory. One may be led by this observation to posit that such a theory is necessarily 
finitely Mac Lane coherent. However, there is still the possibility that there are infinitely 
many distinct substitution-reduced diamonds, since the diamonds may possess different 
source spans. This problem proves to be insurmountable, as demonstrated in the following 
example. 

Example 6.4.6. In this example, we construct a finitely branching, terminating labelled 
rewriting theory that is not finitely Mac Lane coherent. Let if be the labelled rewriting 
theory consisting of the following function symbols: 

• Nullary: W 

• Unary: S, S', T, T' 

• Binary: F 

together with the following reduction rules: 

F(F(a,b),c) F(a,b) 
F{S{a),b) -Z+W 

F(a,T(b)) -!U W 
S(a) S'(a) 
T{a) -U T'(a) 

Then, F(if ) contains the following infinite sequence of diamonds: 
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F(S'(a),T(a)) 




F{S{a),T(a)) W 




F(S(a),T'(a)) 



F(F(S>(a),b),T(c)) 




F(F(S(a),b),T'(c)) — — F(S(a),b) 



F(F(F(S'(a),b),c,),T(d)) 




F(F(F(S(a),b),c,),T(d)) 
I 

\ 

F(F(F(S(a),b),c,),T'(d)) 

F(F(S(a),b),c) 




Since no diamond in the above sequence is a substitution-instance of another, it follows 
from Theorem 6.4-3 that is not finitely Mac Lane coherent. 

In this chapter, we have developed very general tools for investigating coherence prob- 
lems in non-confluent and non-terminating rewriting 2-theories. In the following chapter, 
we apply these tools to a concrete theory arising in algebraic topology. 
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Iterated monoidal categories 

The coherence theorems developed in Chapter 6 are primarily useful for investigating 
non-confluent and/or non-terminating categorical structures. As we have seen previously 
in Chapter 4 a vast array of categorical structures suffer from neither of these deficiencies. 
This might lead one to suspect that any "natural" categorical structure is both confluent 
and terminating. Unfortunately this is not the case. In this chapter, we investigate the 
theory of iterated monoidal categories [BFSV03], which arise naturally as a categorical 
model of iterated loop spaces. This theory posesses two features making its coherence 
problem difficult: it has a non-trivial equational theory at the term level and it is non- 
confluent. A coherence theorem is developed in [BFSV03], which says that there is a 
unique map in an n-fold monoidal category between two terms without repeated variables. 
The proof proceeds via an intricate double induction on the number of variables and the 
dimension of the outermost tensor product in the target of a morphism. In this chapter, 
we exploit Theorem 6.3.7 to provide a more conceptually straightforward proof of this 
theorem. 

7.1. Definitions and basic properties 

An n-fold monoidal category contains n monoidal structures linked via "interchange" 
maps. The presentation given in [BFSV03] endows each tensor product with strict asso- 
ciativity and unit constraints. One of the interesting features of this structure is that the 
n tensor products all have the same unit. This fact, coupled with the equational theory on 
terms, allows for some unexpected interplay between the interchange maps. For instance, 
it is not immediately obvious that the structure is non-confluent. This section introduces 
n-fold monoidal categories and explores some of the subtleties that arise. 

Definition 7.1.1. The rewriting 2-theory for n-fold monoidal categories is denoted 
^ n and consists of the following. 

(1) n binary functor symbols: <8>i, • • • , <S> n 

(2) A nullary functor symbol I 

(3) For 1 < % < n: 

a ®i (b <S>i c) = (a (g>i b) <g>j c 
a <S>i I = a 
I <8>i a = a 
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(4) For each pair such that 1 < i < j < n, there is a reduction rule, called 

interchange: 

Va,b,c,d : ( a ®J ®* ( C ®J ^) — * (° ®* C ) ®? ®* ^) 

T/ie interchange rules are subject to the following conditions: 

(1) Internal unit condition: if^ b j j = rfj T ab = ida^j b 

(2) External unit condition: rf^j b j = ffj a Ib = id a ® i & 

(3) Internal associativity condition: The following diagram commutes: 



(a <8>j b) ®j (c ® j d) ®j (e % /) 



«4 ®j f, ®» *7"d,e,/ 



(a ®j b) ®i ((c ®i e) % (d <S>i /)) 



n J 

'a,b,c ®^ e,d (g>^ / 



((a ®i c) ®j (6 ®i d)) ®; (e ® j /) 



n 

'a (g>^ c,6 (g>^ d,e,f 



-s- (a ®j c ® j e) (g)j- (6 ®i d ®j /) 



(4) External associativity condition: The following diagram commutes: 



(a ® j- 6 (gij c) ® j (d (Sij e®j f) 



>a,b ®j c,d,e ® j / 



77 

'a (glj- 6,c,d ® j e,f 



((a ®j b) ®i (d ®j c)) g) j (c ®; /) 



id a 2li d ®j r)b, c ,e,f 

(a ®j d) ®j ((6 ®j c) <S>< (e ®j /)) >■ (a ®, d) % (6 ®, e) ® .,■ (c ®j /) 

(5) Giant hexagon condition: The following diagram commutes: 

((a ® k b) ®j (c ® fc d)) ®; ((e ® fc /) <g>j- (g ® fc /i)) 




((a ®j c) ® k (6 ®j d)) ®; ((e ®j g) ® fe (/ ®j h)) ((a ® fc 6) ®; (e ® fc /)) ®j ((c ® fc d) ® t (g ® fc /i)) 



77 lfe ®j J? 1 * 



((a <g> j c) ®; (e (gij g)) ® fc ((6 ®j d) ®* (/ ®j h)) ((a ®; e) ® fc (b ®j /)) ®j ((c ®i g) ® fc (d ®* /i)) 




((a ®i e) ®j (c ®i g)) ® fe ((6 ®; /) ® j (d ®j K") 

In i/ie giant hexagon, k) is such that \ <i<j<k<n and the labels have 
the evident components. 
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Since the terms appearing in each reduction rule of *dt n are linear, we immediately 
obtain the following lemma. 

Lemma 7.1.2. A reduction [s] — > [t] in F(^# ra ) is in general position if and only if s 
and t contain no repeated variables. □ 

Because of the fact that an n-fold monoidal category is strictly associative and has a 
strict unit, we can derive various maps via Eckmann-Hilton style arguments. Two of these 
maps will be of particular use to us. In the following, we assume that is such that 

1 < i < J < n. The derived maps are as follows: 

ij 
a,b 

(1) Dimension raising: a <g)j b <£>j b. This represents the following composi- 
tion: 

a®ib ^(a(Sj I) (Si (I (Sjb) (a ® ; I) <g>j (J (S i b) >-a®jb 



% 3 
T , 

(2) Twisted dimension raising: a (E>i b — ^b a. This represents the following 
composition: 

a (Sib (Sj a) (Si (I (Sj b) ®i b) (Sj (a (Si I) *-b(Sj a 

With the above maps, it is easy to see that iterated monoidal categories do not have 
unique normal forms. 

Lemma 7.1.3. If n > 2, then F(^# n ) is not confluent. 

PROOF. The following span is not joinable: 

, in 
L a,b 

a (Sib a(S n b 

in 
T a,b 

b® n a 

□ 

In the following section, we tackle the coherence problem for <^ n . 

7.2. Proving coherence 

Our first step in investigatng the coherence problem for iterated monoidal categories 
is to bring them into the realm of applicability of Theorem 6.3.7. 

PROPOSITION 7.2.1. Jt n is quasicycle-free. 

Proof. Let J^n 1 be the rewriting 2-theory that arises by replacing the reduction rules 
rf 3 in with the following reduction rules, where 1 < i < j < n: 

Ca,b,c,d '■ ( a ®» c ) ®j (b <8>i d) (a <8)j b) ®i (c ®j d). 
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Given an object [s] G F( 1 ^#^ 1 ), we may assume that s contains no instances of /. It follows 
that if there is a reduction [s] — > [i] in F(^#^ 1 ), then i contains the same variables as s, 
as well as the same number of function symbols. Since there are only finitely many such 
possibilities, is quasicycle-free. □ 

Let t be a term in F(^# n ). For a set X C Var(i), we write t — X to denote the term 
resulting from substituting I for each variable in X. For instance (a®ib)<g)j(c<S)id) — {b, d} = 
a <g)j c. We say that a term u is in a term t and write u G t if there is some X C Var(t) 
such that i — X = u. Of crucial importance to us is the following result of [BFSV03]. 

Theorem 7.2.2 ([BFSV03]). Let t and u be terms in W(j£ n ). A necessary and 
sufficient condition for the existence of a reduction t — > u in ¥(^ n ) is that, for each 
a,b G Var(i), if a <8>i b G t, then one of the following holds: 

• There is some j > i such that a®j b € u 

• There is some j > i such that b®j a G u 

Theorem 7.2.2 gives us the technical tool that we need in order to show that various 
parallel pairs of maps are not diamonds. We begin our analysis of the collection of diamonds 
of ¥(^ n ) with diamonds whose source span rewrites disjoint subterms. 

Lemma 7.2.3. Let a <S>i b G F(^# ra ) and suppose that there are maps <p : a — > a' and 
ip : b — > b' . Then, in the following diagram, the square labelled (d) is a commutative 
diamond and there is a map a' ®ib' — > c : 



a®ib' 




Proof. The square labelled (d) commutes by functoriality and it is easy to see that 
it does not admit a zig-zag subdivision, so it is a diamond. The tricky part is showing the 
existence of a map a! <S>i b' — > c. 

Let x, y G Var(a' <8>i b') and suppose that x (S>k V G a' ®i b' . There are a few cases to 
consider. 

• If x, y G a', then a implies that there is some m > k such that x ® m y G c or there 
is some m > k such that y ® m x G c. 

• If x, y G b', then implies that there is some m > k such that x ® m y G c or there 
is some m > k such that y (g> m x G c. 

• If x G a' and y G b', then x ®« y G a' (g> b. So, by a, there is some m > i such that 
£ ®m y G c or there is some m > i such that y <S> m x G c 
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Putting all of the above facts together, it follows from Theorem 7.2.2 that there is a map 



kb' 



c. 



□ 



Next, we investigate diamonds whose initial span rewrites nested subterms. For a term 
a and a subterm b < a, we write a{b} to represent this nested term. 

Lemma 7.2.4. Let a{b} e W(j£ n ) and suppose that there are maps 92 : a{b} — > a'{6} 
and ip : b ^> b' . Then, in the following diagram, the square labelled (d) is a commutative 
diamond and there is a map a'{b'} — > c: 

a{b'} 




a{b} (d) a'{b'} 



<P \ / a'{ip} 
a'{b} 



PROOF. The square labelled (d) commutes by naturality. The rest of the proof is 
similar to that of Lemma 7.2.3. □ 

We now know that source spans of the only remaining diamonds in F(^# n ) rewrite 
overlapping terms. 

7.2.1. Interchange + associativity. Let j > i. The first way in which interchange 
and associativity can interact is in the term X <g)j (c <g>j d) <8>i (e <8>j /). Without loss of 
generality, we may assume that X = a <S)j b, because we could always take X = X <g) j I. 
The resulting span then gets completed into the internal associativity axiom. One may 
then apply Theorem 7.2.2 to show that there is no other diamond with the same initial 
span. 

The second way in which interchange can interact with associativity is in the term 
(a 0j b) <8)j (c (8>j d <8>j e). In this case, we get the following square, where the labels have 
the evident components. 



(a®ic)®j(b®i(d®je)) 



The above square commutes by substituting (a<8>j I ®j b) <g>j (c<8>j d(S>j e) for the source 
and using the external associativity axiom. Theorem 7.2.2 easily yields that there can be 
no other diamonds with the same initial span. 

Similarly, a critical span arises at (a <8>j b <S>j c) (8>i (<i <8>j e). The analysis is similar to 
the previous case by inserting a unit to obtain (a ®j 6 CSj c) <8>i (d <8>j / ®j e) . 
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7.2.2. Interchange + interchange. Let i < j < k. An overlap between interchange 
rules occurs at (a ®j b) <8>j ((c <8>fc c2) <S>j (e <S>j /))• Since we have strict units, we may assume 
that a = a,\ ® k at and b = b\ ® k &2- We then obtain the initial span of the giant hexagon 
axiom. The hexagon forms a diamond and it follows from Theorem 7.2.2 that there are 
no other diamonds with this initial span. 

7.2.3. Interchange + units. The critical spans arising from the interaction of in- 
terchange with units yield the various Eckmann-Hilton maps. As we have seen, these are 
not always joinable. When they are, they commute by the following lemma. 

Lemma 7.2.5. The following diagrams commute in F(^# n ), where \ <i<j<k<n: 




PROOF. This follows from [BFSV03, Lemma 4.22]. More explicitly it follows from 
the giant hexagon axiom by making the following substitutions: 

(1) a <8>i b = ((a ® k I) ®j (I ® k I)) ®i ((/ ® k I) ®j {I ® k b)) 

(2) a®ib= ((I ® k I) ®j {a ® k I)) ®i {(I ® k b) ®j {I ® k I)) 

(3) a®ib= ((I ® k a) ®j (I ® k I)) ®i ((I ® k I) ®j (b ® k I)) 

(4) a ®i b = ((I ® k I) ®j {I ® k a)) ®i {(b ® k I) ®j {I ® k I)) 

□ 

7.2.4. Putting it all together. We have seen that F(«y# n ) is quasicycle-free and that 
every diamond in F(^# ra ) commutes. We can therefore apply Theorem 6.3.7 to obtain the 
coherence theorem for iterated monoidal categories. 

Theorem 7.2.6. // a and b are terms of F(^# n ) having no repeated variables, then 
there is at most one reduction a — > b in F(^# ra ). □ 

Theorem 6.3.7 provided a valuable strategy for proving the above coherence theorem 
for iterated monoidal categories. Although some careful combinatorial investigations were 
still required, the overall proof is conceptually straightforward. This demonstrates that 
even reasonably complicated coherence problems for quasicycle-free rewriting 2-theories 
may be comparatively easily attacked with the tools from Chapter 6. 
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Conclusion 

We have developed rewriting 2-theories as an abstract framework for studying coher- 
ence problems for covariant categorical structures. While general coherence theorems have 
been developed previously for certain classes of covariant structures [Pow89, Lac02], the 
work has typically been at an abstract categorical level and so does not yield any tech- 
niques for constructing specific coherence diagrams. More recent work on this problem has 
yielded an approach to obtaining coherence axioms for invertible theories [FHK]. How- 
ever, the coherence axioms chosen in [FHK] are all of the diagrams in general position. 
Certainly, this vastly over-axiomatises most theories and the authors in [FHK] note: 

"It is not clear what general scheme would select coherence diagrams 
"correctly" in accordance with what one expects for specific examples of 
algebraic structures known." 

The work in Chapter 4 on complete rewriting 2-theories and in Chapter 6 on quasicycle- 
free rewriting 2-theories does, however, provide a general scheme for selecting coherence 
diagrams in accordance with what one would expect for particular algebraic structures. 
For complete rewriting 2-theories, one only needs to select a joining of each critical span 
in order to obtain a complete set of coherence axioms. For quasicycle-free rewriting 2- 
theories, a complete set of coherence axioms is provided by the basic diamonds. It is, 
however, generally more difficult to construct coherence axioms in the quasicycle-free case. 
This is demonstrated by the intricate investigation required in Chapter 7 for iterated 
monoidal categories as opposed to the relatively straightforward investigation of Catalan 
categories in Chapter 5. 

Our combinatorial approach to coherence has the additional benefit of retaining a close 
link to classical one-dimensional universal algebra. This has allowed us, in Chapter 3, to 
use coherent categorifications of balanced equational theories to build presentations of 
the associated structure monoids and groups. Combined with our general techniques in 
chapters 4 and 6 for constructing coherence axioms, this provides a powerful toolkit for 
developing presentations of groups and monoids. This combination came to the fore in 
Chapter 5, where we constructed new presentations for the higher Thompson groups F n \ 
and the Higman-Thompson groups G nj ±. 

In the following section, we outline some additional questions raised by the thesis. 
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Further questions 

Both of our general coherence theorems for incomplete theories, Theorem 6.2.3 and 
Theorem 6.3.7, rely on the underlying structure being quasicycle-free. One may well call 
this condition into question and wonder whether we can get away with a weaker condition. 
For Lambek coherence, quasicycle-freeness does not capture all covariant structures known 
to be Lambek coherent. For example, braided monoidal categories are certainly not quasi- 
cycle free and yet their Lambek coherence problem is solvable via the Reidemeister moves 
[JS93]. However, employing the Reidemeister moves adds an additional rewrite system to 
the reductions, thus expanding the amount of information available. 

Question 1. Are there general properties that ensure Lambek coherence for non quasicycle- 
free rewriting 2-theories? 

The reliance on quasicycle-freeness for Mac Lane coherence seems more fundamental. 
However, two crucial ingredients of our theory rely predominantly on acyclicity: Theorem 
6.1.16 establishes that the faces of a subdivision are themselves si-graphs, while Proposition 
6.3.6 shows that the faces of a maximal subdivision are diamonds. 

Question 2. What conditions on an acyclic rewriting 2-theory ensure Mac Lane co- 
herence ? 

It seems likely that acyclic rewriting 2-theories in which every diamond commutes are 
Mac Lane coherent. The major obstruction to showing this is that maximal subdivisions 
are no longer guaranteed to exist. 

As noted in Chapter 2, rewriting 2-theories with an empty set of coherence axioms 
correspond to the unconditional fragment of rewriting logic. Meseguer has shown a strong 
connection between rewriting logic and models of concurrency [Mes92]: the congruence 
classes of terms correspond to states of the system, while reductions correspond to pro- 
cesses. A parallel pair of reductions that are equal correspond in this framework to a truly 
concurrent pair of processes. That is, they correspond to a pair of processes that may 
be safely run in parallel. In this way, one may view coherence axioms as specifications 
that certain parallel pairs of processes that seemingly interact with one another may in 
fact be safely run in parallel. In this way, the Lambek coherence problem asks about the 
existence of a decision procedure for determining which processes may be safely run in 
parallel. This computational interpretation of the Lambek coherence problem motivates a 
more refined investigation of decision procedures for the commutativity of diagrams arising 
from rewriting 2-theories. 

Question 3. What is the computational complexity of deciding whether a diagram 
commutes in the structure generated by a Lambek coherent rewriting 2-theory? 

As we saw in Chapter 2, determining whether a finitely presented rewriting 2-theory 
is Mac Lane coherent is in general undecidable. However, this does not rule out the 
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possibility of developing algorithms for tackling Mac Lane coherence. Indeed, there exist 
many successful algorithms for determining whether a term rewriting theory is terminating, 
even though this problem is also undecidable in general — a powerful such algorithm is 
provided by the dependency pairs method [ArtOO, HM05]. 

Project 4. Develop algorithms for constructing coherent categorifications of labelled 
rewriting theories and for determining whether a given rewriting 2-theory is Mac Lane 
coherent. 

A finite presentation of a coherent categorification of an equational theory leads to an 
infinite set of singular morphisms. Thus, the presentations constructed in Chapter 3 yield 
an infinite presentation of the associated structure monoid or group. This presentation has 
the nice property of imbueing the orbit graph of the resulting monoid or group with the 
geometry of the categorical structure. However, many structure groups are in fact finitely 
presentable. In particular, this is the case for the groups F„ i and G n ,i- 

Question 5. Are there properties of a coherent categorification of an equational theory 
that imply the finite presentability of the associated structure monoid or group? 

Our investigations have been wide-ranging, touching on topics from category theory, 
computer science, universal algebra and group theory. This has allowed us to to use 
computational insights to prove theorems about mathematical objects, to construct general 
coherence theorems that yield information about the actual coherence diagrams and to 
build presentations of groups using very general techniques. Hopefully future work will 
continue to exploit techniques across traditional subject boundaries, so as to illustrate 
connections and to foster dialogue. 
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